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(54) VON WILLEBRAND FACTOR (VWF)-CLEAVING ENZYME 



(57) This invention is intended to isolate and identify 
a vWF-specific cleaving protease. 

The vWF-specific cleaving protease cleaves a bond 
between residues Tyr 842 and Met 843 of vWF and com- 
prises a polypeptide chain having Leu-Leu-Val-Ala-Val 
as a partial sequence, and more preferably comprises 
a polypeptide chain having the partial N-terminal amino 
acid sequence of a mature protein, Ala-Ala-Gly-Gly-lle- 



Leu-His-Leu-Glu-Leu-Leu-Val-Ala-Val, and having a 
molecular weight of 1 05 to 1 60 kDa in SDS-PAGE under 
reducing or non-reducing conditions. Isolation and iden- 
tification of this vWF-specific cleaving protease have led 
to the possibility of replacement therapy for patients 
having diseases resulting from a deficiency of the pro- 
tease, such as thrombotic thrombocytopenic purpura. 
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Description 

Technical Field 

5 [0001] The present invention relates to a plasma protein related to the field of medical drugs. More particularly, the 
present invention relates to a protease that specifically cleaves von Willebrand factor (it may be hereafter referred to 
as "vWF"), which is associated with blood coagulation. The vWF-cleaving protease of the present invention enables 
replacement therapy for patients with diseases resulting from defects or decreases in this protease, such as thrombotic 
thrombocytopenic purpura (it may be hereafter referred to as "TTP"). In addition, the use thereof as a novel antiplatelet 

10 thrombotic agent is expected. 

Background Art 

[0002] vWF is produced in vascular endothelial cells or megakaryocytes, and is a blood coagulation factor in which 
15 a single subunit comprising 2,050 amino acid residues (monomers of about 250 kDa) are bound by an S-S bond to 
form a multimer structure (with a molecular weight of 500 to 20,000 kDa). The level thereof in the blood is about 10 
jug/ml, and a high-molecular-weight factor generally has higher specific activity. 

[0003] vWF has two major functions as a hemostatic factor. One of the functions is as a carrier protein wherein vWF 
binds to the blood coagulation factor VIII to stabilize it. Another function is to form platelet plug by adhering and ag- 

20 glomerating platelets on the vascular endothelial subcellular tissue of a damaged vascular wall. 

[0004] Thrombotic thrombocytopenic purpura is a disease that causes platelet plug formation in somatic arterioles 
and blood capillaries throughout the whole body. In spite of recent advances in medical technology, the morbidity 
associated with this disease approximately tripled from 1 971 to 1 991 . Pathologically, TTP is considered to result from 
vascular endothelial cytotoxicity or vascular platelet aggregation. Immunohistologically, a large amount of vWFs are 

25 recognized in the resulting platelet plugs, and vWF is considered to play a major role in causing them. A normal or 
high-molecular-weight vWF multimer structure is dominant in a TTP patient, and an unusually large vWF multimer 
(ULvWFM) or large vWF multimer (LvWFM) is deduced to play a major role in accelerating platelet aggregation or 
microthrombus formation under high shearing stress. In contrast, vWF was known to degrade at a position between 
residues Tyr 842 and Met 843 by the action of vWF-cleaving protease in the circulating blood of a healthy person under 

30 high shearing stress. Accordingly, TTP is considered to occur in the following manner. The protease activity in the 
plasma is lowered for some reason, and ULvWFM to LvWFM are increased to accelerate platelet aggregation. This 
forms platelet plugs in blood vessels. 

[0005] Recently, Furlan et al. (Blood, vol. 87, 4223-4234: 1996, JP Patent Publication (Kohyo) No. 2000-508918) 
and Tsai et al. (Blood, vol. 87, 4235-4244: 1996) developed a method for assaying vWF-specific cleaving protease. In 
35 their report, this protease activity was actually lowered in TTP. The aforementioned authors reported that this enzyme 
was metalloprotease in the plasma and partially purified. However, they have not yet succeeded in the amino acid 
sequencing which would specify the protease. There have been no further developments since then. 

Disclosure of the Invention 

40 

[0006] Up to the present, plasmapheresis therapy has been performed for treating patients who congenitally lack 
vWF-specific cleaving protease and patients who had acquired positive antibodies against this protease. Establishment 
of replacement therapy using purified products or a pure substance such as a recombinant gene product of the afore- 
mentioned protease is desired. Familial TTP patients congenitally lack vWF-specific cleaving protease, and non-familial 

45 TTP is caused by posteriori production of autoantibodies against the aforementioned protease. Accordingly, replace- 
ment therapy for this protease is preferable for familial TTP patients (plasma administration is actually performed), and 
removal of autoantibodies by plasmapheresis and substitution of this protease are necessary for non-familial TTP. 
Further, the use of this protease as a novel antiplatelet thrombotic agent can also be expected. 
[0007] As mentioned above, however, Furlan et al. (Blood, vol. 87, 4223-4234: 1996, JP Patent Publication (Kohyo) 

50 No. 2000-508918) and Tsai et al. (Blood, vol. 87, 4235-4244: 1996) have suggested that the vWF-cleaving protease 
was metalloprotease in the plasma. It was reported to be partially purified, and concentrated 1 ,000- to 1 0,000-fold from 
the plasma in terms of its specific activity. Even under these conditions, there has been no advancement in the analysis 
of the properties of this protease, such as the amino acid sequence of its protein, over the period of roughly 5 years 
that has passed since then. No specific biological information has yet been obtained regarding this protease. As re- 

55 ported by Furlan et al., the protein of interest is supposed to be gigantic, and there may be various problems associated 
therewith. For example, diversified forms of this protease, such as various interacting molecules or cofactors, are 
expected. Based on the complexity of purification processes, deteriorated capacity of separation by nonspecific inter- 
action during the purification step, and other factors, it is deduced to be very difficult to isolate and identify the protease 
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from a plasma faction by the purification process according to Furlan et al. 

[0008] Under the above circumstances, the present inventors have conducted concentrated studies in order to isolate 
and identify the vWF-cleaving protease. As a result, they have succeeded in isolating and purifying the vWF-cleaving 
protease of interest, which had not yet been reported. Thus, they have succeeded in identifying an amino acid sequence 

5 of the mature protein and a gene encoding this amino acid sequence. 

[0009] The vWF-cleaving protease of the present invention can cleave a bond between residues Tyr 842 and Met 
843 of vWF. According to one embodiment, this protease has a molecular weight of 1 05 to 1 60 kDa or 1 60 to 250 kDa 
in SDS-PAGE under reducing or non-reducing conditions. It is comprised of a polypeptide chain having Leu-Leu-Val- 
Ala-Val as a partial sequence. More preferably, it is comprised of a polypeptide chain having the partial N-terminal 

10 amino acid sequence of a mature protein, i.e., Ala-Ala-Gly-Gly-De-Leu-His-Leu-Glu-Leu-Leu-Val-Ala-Val. It is a novel 
substance characterized by the following properties. 

1) vWF-cleaving activity 

15 [0010] According to the N-terminal sequence analysis of the cleavage fragment, the protease of the present invention 
cleaves a peptide bond between residues Tyr 842 and Met 843. 

2) Fractionation by gel filtration 

20 [0011] When fractionation is performed by gel filtration chromatography using Fl paste as a starting material, most 
activities are collected in a fraction with a molecular weight of 150 to 300 kDa. According to one embodiment of the 
present invention, an actually obtained active substance is found to have a molecular weight of about 1 05 to 1 60 kDa 
in electrophoresis. Accordingly, the protease of the present invention is a substance that is likely to form a dimer or 
the like or to bind to another molecule or a substance that can be easily degraded or can have a heterogeneous sugar 

25 chain added. 

3) Ammonium sulfate precipitation 

[0012] For example, when Fl paste is used as a starting material, a large portion of this protease is recovered as a 
30 precipitation fraction from a roughly purified fraction with the use of 33% saturated ammonium sulfate. 

4) SDS-PAGE 

[0013] For example, the protease of the present invention derived from Fl paste prepared from pooled human plasma 
35 or cryoprecipitate mainly has a molecular size of about 105 to 160 kDa determined by a molecular weight marker in 
SDS-PAGE. Based on the nucleic acid sequence as shown in SEQ ID NO: 15, when an amino acid sequence repre- 
sented by a frame between an atg initiation codon at position 445 and a tga termination codon at position 4726 is 
expressed by gene recombination, there are some variations in molecular sizes depending on a host. However, a 
molecular size of about 1 60 to 250 kDa determined by a molecular weight marker is exhibited. This size is observed 
40 in the plasma of healthy humans and in that of some TTP patients. Several molecular species of this protease are 
present in human plasma, caused by the presence of alternative splicing products (SEQ ID NOs: 16 to 21) recognized 
at the time of gene cloning, differences in post-translational modification such as sugar chain addition, or degradation 
during purification. Further, this protease could be partially recovered in an active state after SDS-PAGE under non- 
reducing conditions. 

45 

5) Analysis of amino acid sequence 

[0014] The amino acid sequence of the isolated polypeptide fragment was analyzed. This presented an example of 
a polypeptide chain having a sequence Leu-Leu-Val-Ala-Val as a partial amino acid sequence and a sequence Ala- 

50 Ala-Gly-Gly-lle-Leu-His-Leu-Glu-Leu-Leu-Val-Ala-Val as a N-terminal amino acid sequence of a mature protein. Fur- 
ther, with current bioinformatics (BIOINFORMATICS: A Practical Guide to the Analysis of Genes and Proteins, edited 
by Andreas D. Baxevanis and B. F. Francis Ouellette), a nucleic acid sequence encoding the amino acid sequence 
was highly accurately identified by searching a database based on the aforementioned partial sequence. More spe- 
cifically, the genome database was searched by the tblastn program. This identified a chromosome clone (AL1 58826) 

55 that is deduced to encode the protease of the present invention. Further, clones (AI346761 and AJ011374) that are 
deduced to be a part of the protease of interest and a part of the polypeptide to be encoded by the aforementioned 
genome were identified through collation with the Expressed Sequence Tag (EST) database. Based thereon, the amino 
acid sequence as shown in SEQ ID NO: 3 or 7 was identified as an active vWF-cleaving protease site. 
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[0015] GCT GCA GGC GGC ATC CTA CAC CTG GAG CTG CTG GTG GCC GTG, a sequence deduced from the 
genome, and more preferably CTG CTG GTG GCC GTG, a portion thereof, the transcriptome of which was confirmed 
by EST, was obtained. The obtained nucleotide sequence was analyzed, and motif analysis was carried out based on 
the deduced sequence. As a result, it was found to have a metalloprotease domain as a candidate for the protease of 

5 the present invention. Based on the above findings, it became possible to disclose a sequence of a polypeptide chain 
as a more specific example of the protease. Also, activities of proteases are generally known to vary depending on, 
for example, substitution, deletion, insertion, or introduction of point mutation into a portion of the amino acid sequence 
(Blood coagulation factor VII mutants, Soejimaet al., JP Patent Publication (Kokai) No. 2001-61479 A). Similarly, the 
protease of the present invention, can be modified by, for example, deletion, substitution, or addition of one or several 

10 amino acids, to prepare optimized proteases. 

[0016] The protease proteins were further mass-produced, and 29 amino acid sequences from the N-terminus were 
determined. These amino acid sequences are shown in SEQ ID NO: 8. This result is substantially the same as the 
sequence as shown in SEQ ID NO: 3 or 7 deduced by bioinformatics. Only one difference is that the amino acid 27th 
in SEQ ID NO: 3 or 7 was Glu while it was Arg according to the present analysis of the N-terminal sequence. This was 

15 considered to be a gene polymorphism. Thus, this protease was confirmed to be comprised of a polypeptide chain 
having the amino acid sequence as shown in SEQ ID NO: 3 or 7 at its N-terminus as a mature unit. A gene fragment 
encoding this protease was then cloned in the following manner. 

[0017] Based on the nucleic acid sequence as shown in SEQ ID NO: 7, a sense primer (SEQ ID NO: 9) and an 
antisense primer (SEQ ID NO: 1 0) were prepared based on the nucleic acid sequence underlined in Fig. 9, and a gene 

20 sandwiched between these primers was amplified. This fragment was cloned, and the nucleotide sequence was then 
confirmed. This fragment was used as a probe for Northern blotting to analyze the site at which the protease gene was 
expressed. As a result, this protease gene was found to be expressed mainly in the liver. Accordingly, the human liver 
cDNA library was purchased, and a gene encoding this protease was identified using a rapid amplification of cDNA 
ends (RACE) technique. Based on these results, in the case of the largest sequence of approximately 5 kb of mRNA 

25 (cDNA) reaching the poly(A) addition site as shown in SEQ ID NO: 15 was identified. 

[0018] Based on the amino acid sequence deduced from this gene sequence, this protease was deduced to have a 
preprosequence, and to belong to the disintegrin and metalloprotease (ADAM) family having a disintegrin-like domain, 
a metalloprotease domain, and the like, and particularly to the ADAM-TS family having a thrombospondin Type-1 (TSP- 
1 ) domain. Finally, including those having insertion or deletion in a part of the nucleic acid sequence, isoforms as shown 

30 in SEQ ID NOs: 16 to 21 having sequences as shown in SEQ ID NOs: 3 and 7 at the N-terminuses after the mature 
preprosequence has been cleaved were identified. Thus, the protease of the present invention should cleave vWF 
between residues Tyr 842 and Met 843 and should have the Leu-Leu-Val-Ala-Val sequence as a partial amino acid 
sequence. 

[0019] The vWF-cleaving protease of the present invention can be generally prepared by the following process. 

35 [0020] According to the present invention, a process for assaying the protease activity is characterized by the pos- 
sibility of evaluating activity within a short period of time. According to the report by Furlan et al. (Blood, vol. 87, 
4223-4234: 1996, JP Patent Publication (Kohyo) No. 2000-50891 8 A), activity is assayed by analyzing vWF-cleaving 
patterns by Western blotting using the anti-vWF antibody, and thus, it takes time to transfer the protease to a filter. 
More specifically, this process requires approximately at least 45 hours in total, i.e., 24 hours for the enzymatic reaction 

40 with a substrate vWF, 1 7 hours for electrophoresis, and 3 hours to transfer the protease to a filter, followed by detection 
using the anti-vWF antibody. In contrast, the present inventors completed activity assay in 18 hours in total, i.e., 16 
hours for the enzymatic reaction with a substrate vWF, and 2 hours for electrophoresis and detection. This indicates 
that the time required for the assay can be reduced to one third or less of that required for the conventional assay. This 
can also shorten the time required for the purification process, and in turn can lower the degree of the protease to be 

45 inactivated. Accordingly, purification efficiency is improved compared with that attained by the method of Furlan et al., 
and as a result, the degree of purification is also enhanced. 

[0021 ] Further, the starting material was examined using the aforementioned assay system. As a result, it was found 
that the protease activity was more concentrated in Fl paste than in the cryoprecipitate that had been reported by 
Furlan et al. in the past. Fl paste was used as a starting material, and the aforementioned rapid activity assay systems 
50 were combined. This enabled isolation and identification of the protease of interest. In a specific embodiment, a puri- 
fication process combining gel filtration chromatography with ion exchange chromatography is employed, and the 
aforementioned activity assay system is also combined. 

[0022] More specifically, Fl paste is solubilized with a buffer, and the resultant is fractionated by gel filtration chro- 
matography. The protease activity is fractionated at the elution region with a molecular weight of 150 to 300 kDa 
55 deduced from the size marker of gel filtration. Thereafter, the resultant is precipitated and concentrated using 33% 
saturated ammonium sulfate. This procedure is repeated three times in total. The active fraction obtained in the third 
gel filtration is pooled, and the resultant is subjected to dialysis at 4°C overnight with a buffer comprising 50 mM NaCI 
added to 50 mM Tris-HCI (pH 7.1). Thereafter, the dialysis product is subjected to anion exchange chromatography 



4 



EP 1 391 516 A1 



(DEAE) and eluted stepwise with 0.25 M NaCI. The present inventors have conducted concentrated studies in order 
to find a process for isolating and identifying the protease of the present invention. As a result, they found that, sur- 
prisingly, the protease was recoverable as an active band after non-reducing SDS-PAGE. In order to achieve further 
mass production, the purified and concentrated fraction was applied to the Biophoresis utilizing the principle of 
SDS-PAGE. Thus, a fraction having vWF-cleaving activity was isolated from the electrophoresed fraction. According 
to the approximate calculation of the specific activity up to this phase, purification of about 30,000- to 1 00,000-fold was 
achieved. This procedure was efficiently and rapidly repeated several times, and thus, about 0.5 pmole of sample that 
is the current limit of the analysis of amino acid sequence was obtained. Thus, analysis of amino acid sequence became 
feasible. More specifically, a final step of separation and purification (Biophoresis) based on the principle of SDS-PAGE 
is important, and it is based on the findings as a result of concentrated studies, which had led to the completion of the 
present invention. 

[0023] According to the report by Furlan et al., specific activity was improved by as much as about 10,000 times, 
although the protease was not substantially isolated or identified. This could be because of deactivation during purifi- 
cation or the difficulty of isolating and identifying molecules, which were gigantic proteins capable of interacting with 
various other proteins such as the protease of the present invention by a separation method utilizing various types of 
liquid chromatography. Further, the protease content in the plasma was deduced to be very small, and thus, it was 
necessary to await the establishment of the process according to the present invention. Furthermore, the use of this 
process enables the purification of recombinant genes. 

[0024] Based on the findings of the present invention, peptides or proteins prepared from the obtained sequences 
are determined to be antigens. With the use thereof, a monoclonal antibody, a polyclonal antibody, or a humanized 
antibody thereof can be prepared by general immunization techniques (Current Protocols in Molecular Biology, Antibody 
Engineering: A PRACTICAL APPROACH, edited by J. McCAFFERTY et al. or ANTIBODY ENGINEERING second 
edition, edited by Carl A. K. BORREBAECK). Alternatively, an antibody that binds to the aforementioned protein can 
be prepared by antibody-producing techniques utilizing phage display (Phage Display of Peptides and Proteins: A 
Laboratory Manual, edited by Brian K. Kay et al., Antibody Engineering: A PRACTICAL APPROACH, edited by J. 
McCAFFERTY et al. or ANTIBODY ENGINEERING second edition, edited by Carl A. K. BORREBAECK). Alternatively, 
based on these techniques, a neutralizing antibody acting against the protease activity or a simple binding antibody 
can be isolated from a specimen from a TTP patient who has an autoantibody positive against this protease. These 
antibodies can be applied to diagnosis and therapy of diseases such as TTP. 

[0025] Based on the obtained genome or EST sequence, cDNA or a genomic gene encoding the protease of the 
present invention can be cloned by a common technique (Molecular Cloning, 2nd edition). Further, bioinformatics 
techniques (BIOINFORMATICS: A Practical Guide to the Analysis of Genes and Proteins, edited by Andreas D. Bax- 
evanis and B. F. Francis Ouellette) enable cloning of the proteins of other animal species that are homologous thereto, 
and the resultant gene is fractured by a common technique (for example, Gene Targeting: A Practical Approach, First 
Edition, edited by A. L. Joyner, Teratocarci nomas and embryonic stem cell a practical approach) to produce TTP-like 
animal models. In particular, the identification of the gene sequence encoding the protein derived from a mouse enables 
the production of a knockout mouse having this gene. Thus, a disease mouse model of congenital TTP or the like can 
be prepared. 

[0026] In accordance with a common technique (for example, J. Sambrook et al., Molecular Cloning, 2nd edition, or 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY), these genes are incorporated into a suitable expression vector, 
the resultant is transformed into a suitable host cell, and the gene recombinant product of the protease can be thus 
prepared. In this case, the gene to be incorporated is not necessarily the one that encoded the entire region of the 
protein. It also includes a partial expression of the protein as defined by a domain depending on its usage. 
[0027] For example, the polynucleotide according to the present invention is introduced into a host cell using a con- 
ventional technique such as transduction, transfection, or transformation. The polynucleotide is introduced solely or 
together with another polynucleotide. Another polynucleotide is introduced independently, simultaneously, or in com- 
bination with the polynucleotide of the present invention. 

[0028] For example, the polynucleotide of the present invention is transfected in a host cell, such as a mammalian 
animal cell, by a standard technique for simultaneous transfection and selection using another polynucleotide encoding 
a selection marker. In this case, the polynucleotide would be generally stably incorporated in the genome of the host cell. 
[0029] Alternatively, the polynucleotide may be bound to a vector comprising a selection marker for multiplication in 
a host. A vector construct is introduced to a host cell by the aforementioned technique. In general, a plasmid vector is 
introduced as DNA of a precipitate, such as a calcium phosphate precipitate, or a complex with a charged lipid. Elec- 
troporation is also employed for introducing the polynucleotide into a host. When the vector is a virus, this virus is 
packaged in vitro or introduced into a packaging cell, thereby introducing the packaged virus into a cell. 
[0030] Extensive techniques that are suitable for producing a polynucleotide and introducing the resulting polynu- 
cleotide to a cell in accordance with this embodiment of the present invention are known and common in the art. Such 
techniques are described in Sambrook et al. (aforementioned), and this document explains a variety of standard 'ex- 
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perimental manuals describing the aforementioned techniques in detail. In respect of this embodiment of the present 
invention, the vector is, for example, a plasmid vector, a single- or double-stranded phage vector, or a single- or double- 
stranded RNA or DNA viral vector. Such a vector is introduced into a cell as a polynucleotide, and preferably as DNA 
by a common technique for the introduction of DNA or RNA into a cell. When the vector is a phage or virus, the vector 
5 is preferably introduced to the cell as a packaged or sealed virus by a known technique for infection and transduction. 
A viral vector may be of a replication-competent or defective type. 

[0031] A preferable vector is a vector which expresses the polynucleotide or polypeptide of the present invention in 
points. In general, such a vector comprises a cis-action control region that is effective for the expression in a host 
operably bound to the polynucleotide to be expressed. When a suitable trans-action factor (for example, a group of 
10 proteases involved with the post-translational processing such as signal peptidase or Furin) is introduced in a host 
cell, it is supplied by a host, a complementary vector, or the vector itself. 

[0032] In a preferable embodiment, a vector provides specific expression. Such specific expression is an inducible 
one or realized only in a certain type of cell. Alternatively, it is an inducible and cell-specific expression. A particularly 
preferable inducible vector can induce expression by an easily operable environmental factor such as temperature or 
15 a nutritional additive. Various vectors suitable for this embodiment including a construction for the use in prokaryotic 
and eukaryotic cell hosts and an inducible expression vector are known, and persons skilled in the art can commonly 
use them. 

[0033] A genetically engineered host cell can be cultured in general nutrient medium, and it is modified to be partic- 
ularly suitable for activation of promoter, selection of transformant, or amplification of a gene. In general, it would be 
20 obvious to persons skilled in the art that conventional culture conditions such as temperature or pH level for host cells 
selected for the expression are suitable for the expression of the polypeptide of the invention. 

[0034] A wide variety of expression vectors can be used for expressing the polypeptide of the present invention. 
Examples of these vectors include chromosome, episome, and virus-derived vectors. These vectors are derived from 
bacterial plasmid, bacteriophage, yeast episome, yeast chromosome element, or viruses such as baculovirus, papo- 

25 vavirus such as simian virus 40 (SV40), vaccinia virus, adenovirus, fowlpox virus, pseudorabies virus, or retrovirus. A 
vector derived from a combination of the aforementioned, for example, a vector derived from plasmid and bacteriophage 
gene element, more specifically, a cosmid or phagemid, may also be used. They are used for the expression in ac- 
cordance with this embodiment of the present invention. In general, since polypeptides were expressed in hosts, any 
vector that is suitable for maintaining, multiplying, or expressing a polynucleotide can be used for the expression ac- 

30 cording to the aforementioned embodiment. A suitable DNA sequence is inserted into a vector by various conventional 
techniques. In general, a DNA sequence for expression is bound to an expression vector by cleavage of a DNA se- 
quence and an expression vector having 1 or more restriction endonucleases, and a restriction fragment is then bound 
together using T4 DNA ligase. Restriction and ligation techniques that can be used for the above purpose are known 
and common to persons skilled in the art. With regard thereto, Sambrook et al. (aforementioned) very precisely describe 

35 another suitable method for constructing an expression vector utilizing another technique known and common to per- 
sons skilled in the art. 

[0035] A DNA sequence in the expression vector is operably bound to, for example, a suitable expression-regulating 
sequence including a promoter to orient the mRNA transcription. A few examples of known representative promoters 
are the phage lambda PL promoter, E, coli lac, trp, trc, and tac promoters, SV40 early and late promoters, and the 

40 retrovirus LTR promoter. Many promoters that are not described are suitable for the use according to the embodiment 
of the present invention, known, and more easily used as described in the examples of the present invention. In general, 
an expression construct comprises a ribosome binding site for translation in a transcription initiation or termination site 
or a transcribed domain. The coding region of the mature transcript that was expressed by the construct comprises 
the initiation AUG at the initiation and termination codons located substantially at the terminus of polypeptide to be 

45 translated. In addition, the construct comprises a regulator region that regulates and induces the expression. In general, 
such a region is activated through the regulation of the repressor binding site, transcription of an enhancer, or the like 
in accordance with various conventional methods. 

[0036] Vectors for multiplication and expression include selection markers. Such markers are suitable for multiplica- 
tion, or they comprise additional markers for the above-stated purpose. The expression vector preferably comprises 

50 one or more selection marker genes to provide phenotypic traits for the purpose of selecting the transformed host cell. 
A preferable marker includes dihydrofolate reductase- or neomycin-resistance with regard to eukaryotic cell culture. It 
has tetracycline- or ampicillin-resistance with regard to E. coli and other bacterial cultures. A suitable vector comprising 
a DNA sequence and a suitable promoter or regulatory sequence as described herein are introduced to a suitable host 
by various suitable known techniques for the expression of the polypeptide of interest. 

55 [0037] Representative examples of suitable hosts include: bacterial cells such as E. coli, Streptomyces, and Salmo- 
nella typhi murium; fungal cells such as a yeast cell; insect cells such as drosophila S2 and Spodoptera Sf9 cells; and 
adhesive or floating animal or plant cells such as CHO, COS, Bowes melanoma cells, and SP2/0. Various hosts for 
expression constructs are known, and persons skilled in the art can easily select a host for expressing polypeptides 
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in accordance with this embodiment based on the disclosure of the present invention. 

[0038] More specifically, the present invention includes a recombinant construct, such as an expression construct 
comprising one or more sequences as mentioned above. The construct is a vector, such as a plasmid or viral vector 
comprising the sequence of the present invention inserted therein. The sequence is inserted in a positive or negative 

5 direction. In a preferable specific example thereof, the construct further has a regulatory sequence comprising a pro- 
moter or the like that is operably bound to the sequence. Various suitable vectors and promoters are known to persons 
skilled in the art, and there are many commercially available vectors that are suitably used in the present invention. 
[0039] Commercially available vectors are exemplified below. Vectors that are preferably used for bacteria are 
pQE70, pQE60, and pQE-9 (Qiagen); pBS vector, PhageScript vector, Bluescript vector, pNH8A, pNH16a, pNH18A, 

10 and pNH46A (Stratagene); and ptrc99a, pKK223-3, pKK233-3, pDR540, and pRIT5 (Pharmacia). Examples of prefer- 
able eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXT1 , and pSG (Stratagene) and pSVK3, pBPV, pMSG, and 
pSVL (Pharmacia). These vectors are commercially available for persons skilled in the art to be used in accordance 
with the embodiment of the present invention, and they are merely a list of known vectors. For example, other plasmids 
or vectors suitable for introducing, maintaining, multiplying, or expressing the polynucleotide or polypeptide of the 

15 present invention can also be used in hosts in accordance with this embodiment of the present invention. 

[0040] A promoter region can be selected from a gene of interest using a vector comprising, for example, a candidate 
promoter fragment, i.e., a reporter transcription unit lacking a promoter region such as a chloramphenicol acetyl trans- 
ferase (CAT) transcription unit located downstream of restriction sites for introducing promoter-containing fragments. 
As known to the public, the introduction of the promoter-containing fragment into the vector at the restriction site located 

20 upstream of the cat gene generates CAT activity that can be detected by standard CAT assay. A vector that is suitable 
for this purpose is known and readily available. Examples of such vectors are pKK232-8 and pCM7. Accordingly, the 
promoter for expressing the polynucleotide of the present invention includes not only a readily available known promoter 
but also a promoter that can be readily obtained using a reporter gene in accordance with the aforementioned technique. 
[0041] Among them, according to the present invention, examples of known bacterial promoters that are suitably 

25 used to express polynucleotides and polypeptides are E. coli lacl and lacZ promoters, T3 and T7 promoters, gpt pro- 
moter, lambda PR and PL promoters, and trp and trc promoters. Examples of suitable known eukaryotic promoters 
include the Cytomegalovirus (CMV) immediate promoter, the HSV thymidine kinase promoter, early and late SV40 
promoters, a retrovirus LTR promoter such as the Rous sarcoma virus (RoSV) promoter, and a metallothionein promoter 
such as the metallothionein-l promoter. 

30 [0042] Selection of a vector and a promoter suitable for expression in a host cell is a known technique. Techniques 
necessary for the construction of expression vectors, introduction of a vector in a host cell, and expression in a host 
are common in the art. The present invention also relates to a host cell having the aforementioned construct. A host 
cell can be a higher eukaryotic cell such as a mammalian animal cell, a lower eukaryotic cell such as a yeast cell, or 
a prokaryotic cell such as a bacterial cell. 

35 [0043] The construct can be introduced in a host cell by calcium phosphate transfection, DEAE-dextran-mediated 
transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, or other methods. These 
methods are described in a variety of standard laboratory manuals, such as a book by Sambrook et al. 
[0044] The construct in a host cell can be used by a conventional method, and it produces a gene product encoded 
by a recombinant sequence. Alternatively, a partial polypeptide of the present invention can be synthesized using a 

40 general peptide synthesizer. A mature protein can be expressed under the control of a suitable promoter in a mammalian 
animal, yeast, bacterial, or other cell. Also, such a protein can be produced in a cell-free translation system with the 
use of RNA derived from the DNA construct of the present invention. Suitable cloning and expression vectors for 
prokaryotic and eukaryotic hosts are described in Sambrook et al (aforementioned). 

[0045] In general, a recombinant expression vector comprises: a replication origin; a promoter derived from a highly 
45 expressed gene to orient the transcription of a downstream structural sequence; and a selection marker for bringing 
the cell into contact with a vector and isolating the vector-containing cell. A suitable promoter can be induced from a 
gene encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, and heat 
shock protein. A selection marker includes E. coli ampicillin-resistant gene and S. cerevisiae trp1 gene. 
[0046] Transcription of DNA encoding the polypeptide of the present invention using a higher eukaryotic cell may be 
50 enhanced by inserting an enhancer sequence in a vector. The enhancer is generally a cis-acting element for DNA for 
enhancing the promoter transcription activity in the predetermined host cell. Examples of an enhancer include the SV40 
enhancer, the Cytomegalovirus early promoter/enhancer, the polyoma enhancer behind the replication origin, the |3- 
actin enhancer, and the adenovirus enhancer. 

[0047] The polynucleotide of the present invention encoding a heterologous structural sequence of the polypeptide 
55 of the present invention is generally inserted in a vector by standard techniques in such a manner that it is operably 
bound to the expression promoter. The transcription initiation site of the polypeptide is suitably located at the 5' site of 
the ribosome binding site. The ribosome binding site is 5' relative to AUG that initiates the translation of a polypeptide 
to be expressed. In general, an initiation codon starts from AUG and another open reading frame located between the 
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ribosome binding site and initiation AUG is not present. The termination codon is generally present at the terminus of 
the polypeptide, and the adenylation signal and the terminator are suitably located at the 3' end of the transcription 
region. 

[0048] Regarding the secretion of the translated protein in the ER lumen, in the cytoplasm, or to the extracellular 
environment, a suitable secretion signal is incorporated in the expressed polypeptide. The signal may be endogenous 
or heterologous to the polypeptide. 

[0049] Further, a prosequence subsequent to the signal sequence may be endogenous or heterologous (e.g., a 
preprosequence of another metalloprotease). 

[0050] The polypeptide is expressed in a modified form such as a fusion protein, and it includes not only a secretion 
signal but also an additional heterologous functional region. Accordingly, an additional amino acid, especially a charged 
amino acid region, or the like, is added to the polypeptide to improve stability and storage stability in the host cell during 
purification or subsequent operation and storage. Alternatively, a given region may be added to the polypeptide to 
accelerate the purification. This type of region may be removed before the final preparation of polypeptides. Induction 
of secretion or excretion, stability improvement, or facilitation of purification with the addition of a peptide portion to the 
polypeptide is a technique common and known in the art. 

[0051] Examples of prokaryotic hosts that are suitable for multiplying, maintaining, or expressing the polynucleotide 
or polypeptide of the present invention include E. cofi, Bacillus subtilis, and Salmonella typhimurium. Various types of 
Pseudomonas, Streptomyces, and Staphylococcus are suitable hosts in this respect. Furthermore, various other types 
of hosts known to persons skilled in the art can be also used. Representative examples of expression vectors that are 
useful for bacterial applications include, but are not limited to, the replication origin of bacteria derived from commercially 
available plasmid including a selectable marker and a gene element of a known cloning vector pBR322 (ATCC 37017). 
Examples of such commercially available vectors include pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) 
and GEM1 (Promega Biotec, Madison, Wisconsin, USA). These pBR322 (main chain) sections are combined with a 
suitable promoter and structural sequences to be expressed. 

[0052] Host cells are suitably transformed and multiplied to the optimal cell concentration. Thereafter, the selected 
promoter is induced by a suitable means (e.g., temperature shifting or chemical inducer), and cells are further cultured. 
Typically, cells are collected by centrifugation and fractured by a physical or chemical means. The resulting crude 
extract is further purified. Microbial cells used for the protein expression can be fractured by any convenient means 
selected from a freezing-thawing cycle, ultrasonication, mechanical fracture, and the use of a cytolytic agent. These 
methods are known to persons skilled in the art. 

[0053] Various cell lines for mammalian animal cell culture can be also used for the expression. An example of a cell 
line for mammalian animal expression includes a monkey kidney fibroblast COS-6 cell described in Gluzman et al., 
Cell 23: 175 (1981). Examples of other cells that are capable of expressing compatible vectors include C127, 3T3, 
CHO, HeLa, human kidney 293, and BHK cells. Further, a floating myeloma cell line such as SP2/0 can be also used. 
[0054] A mammalian animal expression vector comprises a replication origin, a suitable promoter and enhancer, a 
necessary ribosome binding site, a polyadenylation site, splice donor and acceptor sites, a transcription termination 
sequence, and a 5' franking untranscribed sequence necessary for expression. DNA sequences derived from the SV40 
splice site and the SV40 polyadenylation site are used for the non-transformed or transcribed gene element of interest. 
An example thereof is a CAG expression vector (H. Niwa et al., Gene, 108, 193-199 (1991)). 

[0055] Based on the gene sequence of the above protease, a probe, primer, or antisense is designed by a common 
technique. The antisense technique can be used for controlling gene expression by the use of antisense DNA or RNA 
or the formation of a triple helix. This technique is described in, for example, Okano, J., Neurochem., 56: 560 (1 991 ); 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC Press, Boca Raton, FL 
(1988). The triple helix formation is examined in, for example, Lee et al., Nucleic Acids Research 6: 3073 (1979); 
Cooney et al., Science 241: 456 (1988); and Dervan et al., Science 251: 1360 (1991). The method is based on the 
polynucleotide bond with complementary DNA or RNA. This enables the gene diagnosis or gene therapy. 
[0056] For example, cells obtained from a patient are subjected to ex vivo genetic engineering using a polynucleotide 
such as polypeptide-encoding DNA or RNA. The resulting cells are then supplied to patients who should be treated 
with polypeptides. For example, cells can be subjected to ex vivo genetic engineering using a retrovirus plasmid vector 
comprising RNA encoding the polypeptide of the present invention. Such a technique is known in the art, and the use 
thereof in the present invention is obvious according to the description given herein. Similarly, cells are subjected to 
in vitro genetic engineering in accordance with a conventional process in respect of in vivo polypeptide expression. 
For example, the polynucleotide of the present invention is genetically engineered for expression in the replication- 
deficient retrovirus vector as mentioned above. Subsequently, the retrovirus expression construct is isolated, introduced 
to a packaging cell, and transduced using a retrovirus plasmid vector comprising RNA encoding the polypeptide of the 
present invention. Thus, the packaging cell produces infectious viral particles having a control gene. These producer 
cells are subjected to in vitro genetic engineering and then administered to patients to allow polypeptides to be ex- 
pressed in vivo. This administration method and other methods for administering polypeptides according to the present 
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invention would be clearly understood by persons skilled in the art based on the teaching of the present invention. 
[0057] Examples of the aforementioned retrovirus, from which the retrovirus plasmid vector is derived, include, but 
are not limited to, Moloney murine leukemia virus, spleen necrosis virus, Rous sarcoma virus, Harvey sarcoma virus, 
avian leukosis virus, gibbon leukemia virus, human immunodeficiency virus, myeloproliferative sarcoma virus, and 

5 mammary tumor virus. This type of vector comprises one or more promoters to express polypeptides. Examples of 
suitable promoters that can be used include, but are not limited to, retrovirus LTR, SV40 promoter, CMV promoter 
described in Miller et al., Biotechniques 7: 980-990 (1989), and other promoters (e.g., cell promoters such as a eu- 
karyotic cell promoter including, but not limited to, histone, RNA polymerase III, and (3-actin promoter). Examples of 
other viral promoters that can be used include, but are not limited to, adenovirus promoter, thymidine kinase (TK) 

10 promoter, and B1 9 Parvovirus promoter. Persons skilled in the art can readily select a suitable promoter based on the 
teaching of the present invention. 

[0058] A nucleic acid sequence that encodes the polypeptide of the present invention is under the control of a suitable 
promoter. Examples of suitable promoters that can be used include, but are not limited to, adenovirus promoter such 
as adenovirus major late promoter, heterologous promoter such as CMV promoter, respiratory syncytial virus (RSV) 

15 promoter, inducible promoter such as MMT promoter or metallothionein promoter, heat shock promoter, albumin pro- 
moter, ApoAl promoter, human globin promoter, viral thymidine kinase promoter such as herpes simplex thymidine 
kinase promoter, retrovirus LTR including the aforementioned modified retrovirus LTR, p-actin promoter, and human 
growth hormone promoter. A promoter may be of a native type that controls the gene encoding polypeptides. A retrovirus 
plasmid vector is used to transduce the packaging cell line to form a producer cell line. 

20 [0059] Examples of packaging cells to be transfected include, but are not limited to, PE501 , PA31 7, Y-2, Y-AM, PA1 2, 
T19-14X, VT-19-17-H2, YCRE, YCRIP, GP+E-86, GP+envAm12, and the DAN cell line described in Miller, Human 
Gene Therapy 1 : pp. 5-1 4 (1 990). 

[0060] A vector is transduced in a packaging cell by a means known in the art. Examples of such means include, 
but are not limited to, electroporation, the use of a liposome, and CaP0 4 precipitation. Alternatively, a retrovirus plasmid 
25 vector is sealed in a liposome or bound to a lipid to be administered to a host. A producer cell line produces infectious 
retrovirus vector particles comprising nucleic acid sequences encoding polypeptides. Such retrovirus vector particles 
are used to transduce eukaryotic cells in vitro or in vivo. 

[0061] The transduced eukaryotic cells express nucleic acid sequences encoding polypeptides. Examples of eu- 
karyotic cells that may be transduced include, but are not limited to, germinal stem cells, embryonal carcinoma cells, 
30 hematopoietic stem cells, hepatic cells, fibroblasts, sarcoblasts, keratinocytes, endothelial cells, and bronchial epithelial 
cells. 

[0062] The protease of the present invention, an antibody against this protease, an antagonist of this protease, an 
inhibitor, an agonist, an activity modifier, or the like can be diluted with physiological saline, buffer, or the like to prepare 
a formulation. Thus, a pharmaceutical composition can be obtained. The pH value of the formulation is preferably 

35 between acidulous and neutral: close to the pH level of body fluid. The lower limit thereof is preferably between 5.0 
and 6.4, and the upper limit is preferably between 6.4 and 7.4. Alternatively, the formulation can be provided in a state 
that allows storage for a long period of time, e.g., in a lyophilized state. In such a case, the formulation can be used 
by being dissolved in water, physiological saline, buffer, or the like at a desired concentration level at the time of use. 
[0063] The formulation of the present invention may comprise a pharmacologically acceptable additive, such as a 

40 carrier, excipient, or diluent that is commonly used for pharmaceuticals, a stabilizer, or pharmaceutical^ necessary 
ingredients. Examples of a stabilizer include monosaccharides such as glucose, disaccharides such as saccharose 
and maltose, sugar alcohols such as mannitol and sorbitol, neutral salts such as sodium chloride, amino acids such 
as glycine, nonionic surfactants such as polyethylene glycol, polyoxyethylene and polyoxypropylene copolymers 
(Pluronic), polyoxyethylene sorbitan fatty acid ester (Tween), and human albumin. Addition thereof in amounts of about 

45 1 to 1 0 w/v% is preferable. 

[0064] An effective amount of the pharmaceutical composition of the present invention can be administered by, for 
example, intravenous injection, intramascular injection, or hypodermic injection in one or several separate dosages. 
The dosage varies depending on symptom, age, body weight, or other factors, and it is preferably 0.001 mg to 1 00 mg 
per dose. 

50 [0065] Also, sense or antisense DNA encoding the protease of the present invention can be similarly prepared in a 
formulation to obtain a pharmaceutical composition. 

[0066] Further, the present invention includes methods for inhibiting platelet plug formation involved with heart inf- 
arction or brain infarction, methods for inhibiting arteriosclerosis, methods for preventing restenosis, reembolization, 
or infarction involved with PTCA, methods for preventing reembolization involved with PTCR, and methods for pre- 
55 venting platelet plug formation caused by HUS or 0-157 through the administration of the peptide, protein, and DNA 
of the present invention. Furthermore, the present invention includes the use of the peptide, protein, and DNA of the 
present invention in the production of pharmaceuticals for inhibiting platelet plug formation involved with heart infarction 
or brain infarction, pharmaceuticals for inhibiting arteriosclerosis, pharmaceuticals for preventing restenosis, reembol- 



9 



EP 1 391 516 A1 



ization, or infarction involved with PTCA, pharmaceuticals for preventing reembolization involved with PTCR, and phar- 
maceuticals for preventing platelet plug formation caused by HUS or 0-157. 

[0067] The peptide or protein of the present invention is used as a leading substance for amino acid modification. 
This enables the preparation of a molecule having activity that is different from that of the protease of the present 
5 invention. An example thereof is a variant molecule that can be obtained by preparing an antagonist, which is obtained 
by preparing a variant deactivated through amino acid substitution between an amino acid residue located around the 
active center in the metalloprotease domain and another amino acid, separating a molecule recognition site from a 
catalytic site, or varying one or both of these sites. 

[0068] The use of an evaluation system for the vWF-cleaving activity described herein enables the production of an 
10 antagonist/agonist. For example, an effective antagonist can be a small organic molecule, a peptide, or a polypeptide. 
An example thereof is an antibody that is bound to the polypeptide of the present invention, thereby inhibiting or elim- 
inating its activity. 

[0069] Similarly, the use of the aforementioned evaluation system for vWF-cleaving activity enables the screening 
for a compound that is capable of cleaving vWF. In such a case, the cleaving activity of the test compound may be 
15 evaluated using the aforementioned evaluation system. 

Brief Description of the Drawings 

[0070] 

20 

Fig. 1 is a diagram showing the vWF multimer structure and the point cleaved by the vWF-cleaving protease. 
Fig. 2 is a photograph showing the result of vWF multimer analysis (agarose electrophoresis). 
Fig. 3 is a photograph showing the result of SDS-PAGE (5% gel) for analyzing the vWF-cleaving activity of each 
plasma fraction under reducing conditions. 
25 Fig. 4 is a photograph showing the result of SDS-PAGE (5% gel) for analyzing the solubilized sample of fraction 

1 (F1) paste under non-reducing conditions. 

Fig. 5 is a photograph showing the result of analyzing vWF-cleaving protease fractions after being subjected to 
gel filtration chromatography three times using the solubilized sample of F1 paste as a starting material. Fig. 5A 
is a chart showing gel filtration chromatography, Fig. 5B shows the result of SDS-PAGE on fractions under non- 
30 reducing conditions, and Fig. 5C shows the results of SDS-PAGE on vWF-cleaving activity under reducing condi- 

tions. 

Fig. 6 is a photograph showing the results of analyzing vWF-cleaving protease fractions in which the fraction 
collected by gel filtration chromatography is purified by DEAE anion exchange chromatography. Fig. 6A is a chart 
showing gel filtration chromatography, Fig. 6B shows the result of SDS-PAGE (8% gel) on elution fractions under 
35 non-reducing conditions, and Fig. 6C shows the results of SDS-PAGE on vWF-cleaving activity under reducing 

conditions. In Fig. 6C, three bands indicate an intact vWF molecule (remaining uncleaved), a vWF cleavage frag- 
ment, and a vWF cleavage fragment, respectively, as in Fig. 5C. 

Fig. 7 is a photograph showing an electrophoresed fragment obtained when the vWF-cleaving protease fraction 
purified and concentrated by DEAE anion exchange chromatography is further purified by Biophoresis-based 

40 SDS-PAGE (non-reducing conditions). 

Fig. 8 is a photograph showing the result of electrophoresis on a fraction obtained by further purifying a vWF- 
cleaving protease fraction by Biophoresis-based SDS-PAGE for analyzing vWF-cleaving protease activity and 
SDS-PAGE on active fractions under reducing conditions. Fig. 8A shows the results of SDS-PAGE for analyzing 
vWF-cleaving protease activity under non-reducing conditions, and Fig. 8B shows the results of SDS-PAGE for 

45 analyzing active fractions under reducing conditions. 

Fig. 9 relates to the identification of the vWF-cleaving protease gene, which is a diagram showing primers used 
for amplifying the gene fragment for a Northern blot probe. 

Fig. 10 relates to the identification of the vWF-cleaving protease gene, which is a photograph showing Northern 
blot autoradiography. Fig. 10A shows the results obtained when the protease-encoding gene is used as a probe, 
50 and Fig. 1 0B shows the results obtained when a (3-actin probe (RNA control) is used. 

Fig. 1 1 relates to the identification of the vWF-cleaving protease gene, and is a diagram showing the locations and 
the sequences of the primers used in the RACE experiments. 

Fig. 12 is a diagram showing the locations of primers designed for cloning full-length cDNA. 
Fig. 13 is a diagram showing a process for constructing a vector containing full-length cDNA. 
55 Fig. 14 is a photograph showing the expression in various cell lines (Western blotting under reducing conditions 

using anti-FLAG antibody, where the mock is prepared by inversely inserting a gene in an expression vector). In 
Fig. 14, each lane shows the results using the indicated sample. 
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Lane 1 : Mock (host: 293 cell) 

Lane 2: vWF-cleaving protease, cDNA+FLAG (host: 293 cell) 
Lane 3: Mock (host: HepG2 cell) 

Lane 4: vWF-cleaving protease, cDNA+FLAG (host: HepG2 cell) 
5 Lane 5: Mock (host: Hela cell) 

Lane 6: vWF-cleaving protease, cDNA+FLAG (host: Hela cell) 

Fig. 15 is a photograph showing the activity assay of recombinant expression protease (analysis of vWF-cleavage 
by SDS-PAGE under non-reducing conditions, where the mock is prepared by inversely inserting a gene in an 
10 expression vector). In Fig. 15, each lane shows the results using the indicated sample. 

Lane 1 : Mock (host: Hela cell) 

Lane 2: Supernatant in which vWF-cleaving protease was expressed (host: Hela cell) 
Lane 3: Mock (host: HepG2 cell) 
15 Lane 4: Supernatant in which vWF-cleaving protease was expressed (host: HepG2 cell) 

Lane 5: Mock (host: 293 cell) 

Lane 6: Supernatant in which vWF-cleaving protease was expressed (host: 293 cell) 
Lane 7: Mock (host: BHK cell) 

Lane 8: Supernatant in which vWF-cleaving protease was expressed (host: BHK cell) 
20 Lane 9: Mock (host: COS cell) 

Lane 10: Supernatant in which vWF-cleaving protease was expressed (host: COS cell) 
Lane 1 1 : Mock (host: CHO cell) 

Lane 12: Supernatant in which vWF-cleaving protease was expressed (host: CHO cell) 

25 Fig. 1 6 is a photograph showing the result of Western blotting using an antibody established against the protease 

of the present invention, wherein Western blotting is carried out for various antiserums using the 293 cell as a host 
and a recombinant vWF-cleaving protease. In Fig. 16, each lane shows the results obtained with the use of the 
indicated sample. 

30 Lane 1 : Mouse antiserum (prepared by administering purified protein) 

Lane 2: Rabbit antiserum (prepared by hypodermically administering an expression vector to a rabbit) 
Lane 3: Untreated rabbit antiserum 

Lane 4: Rabbit antiserum (prepared by administering KLH-conjugated partial synthetic peptide) 

35 Fig. 1 7 is a photograph showing the result of Western blotting using an antibody established against the protease 

of the present invention, wherein various samples derived from human plasma and recombinant expression units 
are detected using rabbit antiserum obtained by administering full-length cDNA, of vWF-cleaving protease. In Fig. 
17, each lane shows the results obtained with the use of the indicated sample. 

40 Lane 1 : Partially purified sample derived from human plasma cryoprecipitate 

Lane 2: Purified vWF-cleaving protease derived from human plasma 
Lane 3: Gel-filtrated Fl paste sample obtained from pooled human plasma 
Lane 4: Recombinant vWF-cleaving protease (host: 293 cell) 
Lane 5: Recombinant vWF-cleaving protease (host: Hela cell) 

45 

Fig. 1 8 is a photograph showing the result of Western blotting using an antibody established against the protease 
of the present invention, wherein rabbit antiserum obtained by immunizing a rabbit with a partially synthesized 
peptide of the vWF-cleaving protease is used to confirm the vWF-cleaving protease in healthy human plasma and 
that in the plasma and gene recombinant vWF-cleaving protease of a TTP patient. In Fig. 18, each lane shows 
50 the results obtained with the use of the indicated sample. 

Lane 1 : Gel-filtrated Fl paste sample obtained from pooled human plasma 
Lane 2: Normal human plasma 1 
Lane 3: Normal human plasma 2 
55 Lane 4: Normal human plasma 3 

Lane 5: TTP patient's plasma 1 
Lane 6: TTP patient's plasma 2 

Lane 7: Recombinant vWF-cleaving protease (host: 293 cell) 
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Lane 8: Recombinant vWF-cleaving protease (host: Helacell) 

Fig. 19 is a diagram showing the result of ELISA using an antibody prepared against the vWF-cleaving protease. 
Fig. 20 is a photograph showing the result of SDS-PAGE (silver staining) analyzing each fraction of affinity purified 
5 vWF-cleaving protease using an antibody under reducing conditions. In Fig. 20, each lane shows the results ob- 

tained with the use of the indicated sample. 

Lane 1 : Applied culture supernatant (diluted 10-fold) 
Lane 2: Passed-through fraction 
10 Lane 3: Washed fraction 

Lane 4: Elution fraction 

Fig. 21 is a photograph showing the results of evaluating neutralizing activity using an antibody (SDS-PAGE for 
analyzing vWF-cleaving activity under non-reducing conditions). In Fig. 21, each lane shows the results obtained 
15 with the use of the indicated sample. 

Lane 1 : vWF-cleaving protease solution: normal rabbit serum = 1 :1 

Lane 2: vWF-cleaving protease solution: normal rabbit serum (diluted 5-fold) = 1 :1 

Lane 3: vWF-cleaving protease solution: peptide-immunized rabbit serum = 1 :1 

20 Lane 4: vWF-cleaving protease solution: peptide-immunized rabbit serum (diluted 5-fold) = 1 :1 

Lane 5: vWF-cleaving protease solution : recombinant protein-immunized rabbit serum = 1 :1 
Lane 6: vWF-cleaving protease solution : recombinant protein-immunized rabbit serum (diluted 5-fold) = 1 :1 
Lane 7: vWF-cleaving protease solution: 10mM EDTA = 1:1 
Lane 8: vWF-cleaving protease solution: buffer only = 1 :1 

25 Lane 9: buffer (without vWF-cleaving protease) : buffer =1:1 

Fig. 22 is a diagram showing the construction of an expression vector for a molecular species lacking a C-terminal 
domain. 

30 Best Modes for Carrying out the Invention 

[0071] The present invention is hereafter described in detail with reference to the following examples, although it is 
not limited to these examples. 

35 Example 1 

(Preparation of vWF) 

[0072] A plasma cryoprecipitation (2 g) was dissolved in 20 ml of buffer (0.01 % Tween-80/50 mM Tris-HCI/1 00 mM 
40 NaCI, pH 7.4), and the resultant was subjected to gel filtration using a Sephacryl S-500 HR Column (2.6 x 90 cm, 
Amersham Pharmacia) to prepare vWF. Fractions were recovered at a flow rate of 2 ml/min in amounts of 6 ml each. 
vWF was analyzed by Western blotting using a peroxidase-labeled rabbit anti-human vWF antibody (DAKO), and high- 
molecular-weight vWF fractions were pooled. The pooled fractions were subjected to multimer analysis using agarose 
electrophoresis as described below. 
45 [0073] As shown in Fig. 1 , vWF originally has a multimer structure in which vWF monomer molecules are polymerized 
with each other at their N-terminuses or at their C-terminuses, and vWF is subjected to partial hydrolysis by the vWF- 
specific cleaving protease. As a result of the analysis, as shown in Fig. 2, the purified vWF exhibited a multimer pattern 
based on agarose electrophoresis approximately equivalent to that in the plasma of a healthy person (the ladder in 
the drawing shows the electrophoresis pattern of vWF having a multimer structure, and the upper portion indicates 
so vWF with advanced polymerization). This can prepare vWF comprising substantially no impurities that degrade it, and 
this fraction was used as a substrate when assaying the vWF-cleaving activity as described below. 

Example 2 

55 (vWF-cleaving reaction) 

[0074] vWF-cleaving activity was assayed as follows. A sample comprising 1 0 mM barium chloride (final concentra- 
tion) was pre-incubated at 37°C for 5 minutes to activate protease. A buffer (15 to 20 ml, 1.5 M urea/5 mM Tris-HCI, 
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pH 8.0) was placed in a 50 ml Falcon Tube. Subsequently, a membrane filter (0.025 jam, Millipore) was floated therein, 
and 1 00 jnl of activated sample prepared by mixing with 50 jnl of vWF substrate solution was added. The resultant was 
allowed to stand in an incubator (37°C) overnight and recovered from the filter on the next day. The recovered sample 
was evaluated based on the vWF cleavage pattern as described below in the "SDS-PAGE" section. 

5 

SDS-PAGE 

[0075] SDS-5% polyacrylamide gel was autologously prepared and used. An SDS electrophoresis buffer (2 jul, in 
the presence or absence of a reducing agent, i.e., 2-mercaptoethanol) was added to 1 0 jul of the sample described in 
10 the "vWF-cleaving activity assay" section, and the resultant was boiled for 3 minutes to prepare an electrophoresis 
sample. The gel was subjected to electrophoresis at 30 mA for 1 hour and then stained with the Gel Code Blue Stain 
Reagent (PIERCE) utilizing CBB staining. As shown in Fig. 1, activity is evaluated based on the development of a 
cleavage fragment and the presence or absence of fragments remaining uncleaved under reducing or non-reducing 
conditions. This is more specifically described in Example 3 and Fig. 3 below. 

15 

Multimer analysis utilizing agarose electrophoresis 
Preparation of gel, electrophoresis 

20 [0076] Low gelling temperature agarose (Type VII, Sigma) was added to 375 mM Tris-HCI (pH 6.8) until a concen- 
tration of 1 .4% was reached, followed by heating in a microwave oven to completely dissolve the gel. Thereafter, 0.1% 
SDS was added, and the resultant was maintained at 56°C. The resultant was made to flow into a gel mold and solidified 
by cooling at 4°C overnight (running gel). The next day, high gelling temperate agarose (SeaKem) was mixed with 375 
mM Tris-HCI (pH 6.8) until a concentration of 0.8% was reached, and dissolved by boiling in a microwave oven. There- 

25 after, the resultant was maintained at 56°C (stacking gel). The gel prepared on the previous day was cleaved, leaving 
a 10-cm fraction from the end uncleaved. The aforementioned gel was made to flow into the cleaved portion, and the 
gel was made to keep flowing at 4°C for at least 3 hours, followed by solidification. Pyronin Y was added to the sample 
described in the "vWF cleaving activity assay" section above, and the gel was prepared under non-reducing conditions 
without boiling. The gel was subjected to electrophoresis at 10 mA for at least 24 hours using an SDS-PAGE buffer. 

30 

Western blotting 

[0077] After the electrophoresis, the gel was immersed in a transcription buffer (0.005% SDS, 50 mM phosphate 
buffer, pH 7.4) for 10 minutes, and the resultant was transferred to a nitrocellulose membrane using a transcription 

35 apparatus at 4°C at 0.5 A overnight. Blocking was performed using a blotting solution (5% skim milk, PBS) for 30 
minutes, and the gel was then allowed to react for at least 6 hours with the peroxidase-labeled rabbit anti-human vWF 
antibody (DAKO), which was diluted 1 , 000-fold with the blotting solution. Thereafter, the gel was washed three times 
with the blotting solution and once with PBS, and color was developed using Konica Immunostain HRP-1 000 (Konica), 
which was a substrate reaction solution for peroxidase. The purified vWF analyzed in this assay was found to have 

40 been undegraded, but was sufficiently usable as a substrate in the present invention (Fig. 2). 

Example 3 

(Preparation of vWF-cleaving protease) 

45 

[0078] Plasma was subjected to ethanol fractionation developed by Cohn. A protease having high vWF-cleaving 
activity (one with high specific activity) when protein levels in four fractions (i.e., starting plasma, cryoprecipitate, fraction 
I (Fl) supernatant, and a paste) are made equivalent to each other was selected. As shown in Fig. 3, the protease 
activity was highest in the Fl paste. The N-terminal sequence of this cleavage fragment was analyzed, and as a result, 
50 activity derived from the cryoprecipitate and the Fl paste were found to cleave the peptide bond between residues Tyr 
842 and Met 843. Thus, the Fl paste was determined to be a main starting material for purification thereafter. 

Solubilization of Fl paste 

55 [0079] The Fl paste was fractionated in fractions of 12 g each and then cryopreserved. The paste was allowed to 
melt at 4°C the day before its use. The next day, 120 ml of solubilizing buffer (0.05% azide, 50 mM Tris-HCI (pH 7.4), 
100 mM NaCI) was added at 10 mg/ml, and the mixture was stirred at 37°C for 2 hours. The product was centrifuged 
at 1 0,000 rpm for 1 0 minutes, and the supernatant was then recovered, followed by filtration with a prefilter, a 5.0 jum 
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filter, and a 0.8 |nm filter in that order. The resultant was determined to be a solubilized sample. Fig. 4 shows the result 
of SDS-PAGE of the solubilized sample. 

Gel filtration chromatography of vWF-cleaving protease 

5 

[0080] The solubilized F1 paste was applied to a Sephacryl S-300 HR Column (5 x 90 cm, Amersham Pharmacia) 
to conduct the first gel filtration. A buffer comprising 0.05 % azide, 50 mM Tris-HCI (pH 7.4), and 1 00 mM NaCI (here- 
inafter referred to as an "elution buffer"), which was the same as the solubilizing buffer, was used. The flow rate was 
5 ml/min, fractionation was initiated at 600 ml after the sample application, and fractions were recovered in amounts 

10 of 10 ml each. Fractions were subjected to the vWF-cleaving reaction, and their activities were then analyzed by 
SDS-PAGE. Fractions that exhibited protease activity were pooled, and a small amount of saturated ammonium sulfate 
was gradually added dropwide thereto until a final concentration of 33% saturation was reached. The mixture was 
further allowed to stand at 4°C overnight. The next day, the product was centrifuged at 1 0,000 rpm for 1 0 minutes, and 
an active fraction of interest was recovered as a precipitate. The procedures comprising solubilization, gel filtration, 

15 and ammonium sulfate precipitation were performed for 5 batches and the resultant was cryopreserved at -20°C. 

[0081] The ammonium sulfate precipitates (2 to 3 batches) obtained by the first gel filtration were dissolved in 50 ml 
of elution buffer, and passed through the Sephacryl S-300 HR Column (5 x 90 cm) in the same manner as in the first 
gel filtration to perform the second gel filtration. The elution buffer, conditions, operations, and the like were the same 
as those in the first gel filtration. Fractions were subjected to the vWF-cleaving reaction, and their activities were then 

20 analyzed by SDS-PAGE. Fractions with activity were pooled, and ammonium sulfate precipitation was similarly per- 
formed. These procedures were repeated two times. 

[0082] The ammonium sulfate precipitates (2 batches) obtained by the second gel filtration were dissolved in 50 ml 
of elution buffer, and applied to the Sephacryl S-300 HR Column (5 x 90 cm) in the same manner as in the first and 
the second gel filtration to perform the third gel filtration. The elution buffer, conditions, operations, and the like were 

25 the same as those in the first and the second gel filtration. Fractions were subjected to the vWF-cleaving reaction, and 
their activities were then analyzed by SDS-PAGE, followed by pooling. Fig. 5 shows SDS-PAGE for analyzing these 
fractions and that for analyzing vWF-cleaving activity. Based on the patterns of gel filtration and the data showing 
activity, the protease of the present invention was found to be eluted in the region between fraction 37 and fraction 47. 
Based on a separately conducted elution experiment for high-molecular-weight gel filtration marker (Amersham Phar- 

30 macia), this site of elution was deduced to have a molecular weight equivalent to 150 to 300 kDa. In this phase, con- 
siderable amounts of impurities were still present. 

DEAE anion exchange chromatography 

35 [0083] The pooled fraction obtained by three gel filtration operations was subjected to dialysis overnight with a buffer 
comprising 50 mM Tris-HCI and 50 mM NaCI (pH 7.1). After the dialysis, anion exchange chromatography was per- 
formed using a 5 ml HiTrap DEAE-Sepharose Fast Flow Column (Pharmacia) to conduct further purification and con- 
centration. Equilibrating and washing were performed using a buffer comprising 50 mM Tris-HCI (pH 7.1 ), and elution 
was performed using 0.25 M NaCI. The flow rate was 5 ml/min, and 5 fractions of 5 ml each were recovered and pooled. 

40 Fig. 6 shows the results of SDS-PAGE for analyzing elution fractions and those for analyzing vWF-cleaving activity. 
Based on SDS-PAGE for activity assay, the protease of the present invention having vWF-cleaving activity was con- 
siderably effectively concentrated in the elution fraction. 

Fractionation utilizing SDS-PAGE 

45 

[0084] The sample (5 ml) purified and concentrated by DEAE anion exchange chromatography was further concen- 
trated to 0.5 ml using Centricon (molecular weight cut off: 10,000 Da, Amicon). The protease of the present invention 
was isolated by Biophoresis III (Atto Corporation) utilizing SDS-PAGE. In accordance with the Laemmli method (Nature, 
vol. 227, 680-685, 1 970), a buffer for electrophoresis tanks was prepared, and developed with 8% polyacrylamide gel 

50 to recover the electrophoresis fraction. Fig. 7 shows the result of SDS-PAGE for analyzing the recovered fractions. 
The buffer used for recovery was comprised of 50 mM Tris-HCI and 10% glycerol (pH 8.8). As is apparent from Fig. 
7, this process according to the present invention has a high ability to produce separation. Fig. 8 shows the results of 
analyzing activity of a fraction further purified by electrophoresis and the results of SDS-PAGE for analyzing active 
fractions. The protease of the present invention can be recovered as an active molecule even after SDS-PAGE. When 

55 the activity of this protease in the plasma is determined to be 1 in terms of specific activity, a degree of purification of 
30,000- to 1 00,000-fold was deduced to be achieved based on the average protein content in the plasma (60 mg/ml). 
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Example 4 

(Partial amino acid sequencing) 

5 [0085] The partial amino acid sequence of the isolated protease was determined. This protease, which was isolated 
using Biophoresis, was transferred to a PVDF membrane after SDS-PAGE by a conventional technique, air-dried, and 
then subjected to analysis using the automated protein sequencer (model 492; PE Applied Biosystems). As a result, 
the vWF-cleaving protease of the present invention isolated under the above conditions was found to comprise a 
polypeptide chain having a molecular weight of 105 to 160 kDa in SDS-PAGE under reducing conditions. This protease 

10 was also found to have, as a partial sequence, Leu-Leu-Val-Ala-Val, and preferably Ala-Ala-Gly-Gly-lle-Leu-His-Leu- 
Glu-Leu-Leu-Val-Ala-Val. 

Deduction of isolated protease utilizing bioinformatics 

15 [0086] At present, bioinformatics enables the deduction of full nucleotide sequences encoding a polypeptide without 
substantial gene cloning through collation with information in the database accumulated in the past (BIOINFORMAT- 
ICS: A Practical Guide to the Analysis of Genes and Proteins, edited by Andreas D. Baxevanis and B. F. Francis 
Ouellette). Based on the partial amino acid sequencing by the aforementioned process (Ala-Ala-Gly-Gly-lle-Leu-His- 
Leu-Glu-Leu-Leu-Val-Ala-Val), the database was searched by the tblastn program. As a result, a chromosome clone 

20 (AL1 58826) that was deduced to encode the protease of the present invention was identified by genomic database 
search. Further, a part of the protease of interest as the expressed sequence tag (EST) and a clone that was deduced 
to be a part of the polypeptide encoded by the aforementioned genome (AI346761 and AJ01 1 374) were identified. The 
amino acid sequence as shown in SEQ ID NO: 3 or 7 was deduced based thereon to be an active vWF-cleaving 
protease site. 

25 

Example 5 

(Gene identification) 

30 [0087] Synthesis of all the following synthetic primers was performed by Greiner Japan Co. Ltd. by request. Further, 
reagents used for gene recombination were those manufactured by TAKARA, TOYOBO, and New England Biolabs 
unless otherwise specified. 

Preparation of a gene fragment as a Northern blotting probe 

35 

[0088] A sense primer (SEQ ID NO: 9) and an antisense primer (SEQ ID NO: 10) were prepared. PCR was carried 
out using Universal QU ICK-Clone™ cDNA (Clontech), which was a mixture of cDNA derived from normal human tissue, 
as a template and TaKaRa LA Taq with GC rich buffer. A gene sandwiched between these primers was amplified, and 
the amplified fragment was cloned using a TOPO TA cloning™ kit (Invitrogen). DNAs having the nucleotide sequence 
40 as shown in SEQ ID NO: 6 were isolated from several clones. 

[0089] A vector portion was removed from this cloned DNA by EcoRI digestion, separated and purified by agarose 
electrophoresis, and the resultant was determined to be a template for preparing probes for Northern blotting. 

Northern blotting 

45 

[0090] The gene fragment prepared above was employed as a template to prepare a radioactive probe using [oc- 32 P] 
dCTP (Amersham Pharmacia) and a BcaBEST™ labeling kit (TAKARA). Hybridization was carried out using the Human 
1 2-lane Multiple Tissue Northern Blots™ (Clontech) filter in accordance with the method described in Molecular Cloning 
2 nd Edition, pp. 9.52-9.55. Detection was carried out by autoradiography. As shown in Fig. 10, mRNA encoding the 
50 protease was expressed mainly in the liver. The size of this mRNA was found to be more than 4.4 kb. 

Isolation and identification of gene encoding the protease 

[0091 ] As a result of Northern blotting, mRNA was found to be expressed mainly in the liver. Thus, the protease gene 
55 of the present invention was isolated and identified in accordance with the RACE technique using normal human liver- 
derived poly A + RNA and Marathon-Ready™ cDNA (Clontech). 

[0092] More specifically, the first PCR was carried out as 5' RACE using normal human liver-derived Marathon- 
Ready™ cDNA in accordance with the product's manual and using the AP-1 primer attached to the kit and antisense 
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primers (SEQ I D NOs: 11 to 1 3) arbitrarily selected from the group of Gene Specific Primers (GSP) excluding the primer 
1 located in the uppermost stream as shown in Fig. 11 . Nested PCR (the second PCR) was then carried out using the 
AP-2 primer located in the inside thereof and the antisense primer located in the inside of the primer used for the first 
PCR as shown in Fig. 11 . Thereafter, TA cloning was earned out. Genes were prepared from the developed colonies 
5 in accordance with a conventional technique (Molecular Cloning 2 nd Edition, pp. 1 .25-1 .28), and nucleic acid sequences 
were decoded using an automatic DNA sequencer. The primer used for sequencing was the primer used for PCR or 
a primer located in the inside thereof. Further, the primer was designed based on the sequence determined after serial 
decoding. 

[0093] 3' RACE was started from normal human liver-derived poly A + RNA using the 3'-Full RACE Core Set (TAKA- 
10 RA), and reverse transcription was carried out in accordance with the attached manual using the attached oligo dT 
primer. The band amplified by PCR using the sense primer (SEQ ID NO:14) located at "primer 2" in Fig. 11 and the 
attached oligo dT primer was separated by agarose electrophoresis and extracted, followed by TA cloning. Genes were 
prepared from the developed colonies, and nucleic acid sequences were decoded using an automatic DNA sequencer. 
A primer used for sequencing was designed based on the sequence determined after serial decoding. 

15 

Example 6 

(Preparation of a vector comprising full-length cDNA 1) 

20 [0094] cDNA encoding the protein was subjected to one-stage PCR by, for example, using a sense primer 1 (SEQ 
ID NO: 22) comprising an Xhol restriction site and an initiation codon and an antisense primer 2 (SEQ ID NO: 23) 
comprising an Sail restriction site and a termination codon (see Fig. 1 2), using the aforementioned normal human liver- 
derived Marathon- Ready™ cDNA as a template and the TaKaRa LA Taq with GC rich buffer, followed by the afore- 
mentioned TA cloning. Thereafter, the full length of the product was confirmed using an automatic DNA sequencer. 

25 

Example 7 

(Preparation of a vector comprising full-length cDNA 2) 

30 [0095] Restriction sites Accl and Avrl I that cleaved cDNA only at one point on the inner sequence of the cDNA (SEQ 
ID NO: 15) encoding the protein were found. With the use thereof, full-length cDNA was divided into three fragments 
as shown in Fig. 12. A fragment 1 sandwiched between the sense primer 1 (SEQ ID NO: 22) and the antisense primer 
3 (SEQ ID NO: 24), a fragment 2 sandwiched between the sense primer 4 (SEQ ID NO: 25) and the antisense primer 
5 (SEQ ID NO: 26), and a fragment 3 sandwiched between the sense primer 6 (SEQ ID NO: 27) and the antisense 

35 primer 2 (SEQ ID NO: 23) were provided, respectively, in each of the above three fragments. Each fragment was 
subjected to PCR using the aforementioned normal human liver-derived Marathon-Ready™ cDNA as a template and 
TaKaRa LA Taq with GC rich buffer, followed by the aforementioned TA cloning. The full length of the product was 
confirmed using an automatic DNA sequencer. Further, the pCR 2.1 vector included in the aforementioned TA cloning 
kit was subjected to self ligation, the ligation product was cleaved with Xhol/Hindlll, ligated to a linker comprising Xhol/ 

40 Accl/Avrl l/Hindlll (prepared by annealing the synthetic DNA as shown in SEQ ID NO: 28 or 29), and the three afore- 
mentioned fragments were sequentially ligated in a conventional manner to bind them. Thus, cDNA comprising the 
entire region was prepared (see Fig. 13). 

Example 8 

45 

(Preparation of an expression vector comprising full-length cDNA: an animal cell host) 

[0096] DNA obtained in Example 6 or 7 was digested with restriction enzymes Xhol/Sall, ligated to, for example the 
Sail site in the pCAG vector (Niwa, H. et al., Gene, vol. 108, 193-199), and the direction of the insertion and the full- 
50 length sequence were confirmed using an automatic DNA sequencer. 

Example 9 

(Transfection of an expression vector comprising full-length cDNA into an animal cell) 

55 

[0097] The animal cell expression vector prepared in Example 8 was transfected in the following manner using the 
293 cell (human embryonic kidney cell line), the Hela cell, and the HepG2 cell. At the outset, cells were disseminated 
at 1 to 3 x 10 5 cells per 35 mm dish 24 hours before the transfection. The next day, 2 jnl of polyamine transfection 
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reagent, TransIT (TAKARA), per jug of the expression vector, were added to 100 jliI of a serum-free medium such as 
Opti-MEM to prepare a complex with DNA in accordance with the instructions included with the reagent. Thereafter, 
the complex was added dropwise to the various types of previously prepared cells, and the resultants were incubated 
for 2 to 8 hours, followed by medium exchange. The medium was further exchanged three days later with the selective 

5 medium to which G418 had been added. Thereafter, medium was exchanged every three days to produce a stably 
expressed strain. An example thereof is shown in Fig. 1 4 as a temporarily expressed strain comprising an FLAG epitope 
tag at its C-terminus. Detection was carried out by Western blotting using the anti-FLAG-M2 antibody (Kodack) and 
staining with anti-mouse Ig-alkaline phosphatase-labeled antibody system. The recombinant strain expressed using 
cDNA as shown in this example exhibited a molecular size of about 250 kDa under reducing conditions. This molecular 

10 size was also found in the plasma of a healthy human (Fig. 1 8, Example 1 4 below). Several different molecular species 
of this protease are found to be present in the human plasma, which could be caused by the presence of the alternative 
splicing products (SEQ ID NOs: 6 to 21) observed at the time of gene cloning, difference in post-translational modifi- 
cation such as sugar chain addition, or degradation during purification (described in Example 14 and in Fig. 17 of the 
present invention and Gerritsen etal., Blood, vol. 98, 1654-1661 (2001)). 

15 [0098] Subsequently, the vWF-cleaving activity of the recombinant strain was confirmed by the method described in 
Example 2 (Fig. 1 5). As a result, the human plasma-derived protease and the gene recombinant product of the present 
invention were found to exhibit the same vWF-cleaving activities. 

Example 10 

20 

(Preparation of an expression vector comprising partial cDNA: an E. coli host) 

[0099] Partial cDN A encoding the metalloprotease domain of the protein was subjected to PCR using a sense primer 
comprising an Ncol restriction site and an initiation codon (SEQ ID NO: 30). and an antisense primer comprising an 

25 Hindi 1 1 restriction site and a termination codon (SEQ ID NO: 31), the aforementioned normal human liver-derived Mar- 
athon-Readay™ cDNA or the cDNA obtained in Example 6 or 7 as a template, and the TaKaRa LA Taq with GC rich 
buffer. The PCR product was then digested with Ncol/Hindlll, ligated to the Ncol/Hindll I digest of an E. coli expression 
vector such as pUT1 (Soejima et al., J. Biochem. Tokyo, vol. 130, 269-277 (2001)), and transformed to the E. coli 
competent cell JM 1 09 by a conventional technique. Several clones were collected from the formed colony group, and 

30 genes were prepared therefrom. Thereafter, the resulting genes were confirmed to be the genes encoding the polypep- 
tide, wherein the nucleic acid sequence of the insertion site of the plasmid vector was equivalent to SEQ ID NO: 32 or 
substantially represented by SEQ ID NO: 33, using an automatic DNA sequencer. 

Example 11 

35 

(Expression of partial cDNA-containing expression vector in E. coli) 

[0100] An E. coli host with the expression vector constructed in Example 10 introduced therein was precultured in 
200 ml of LB medium comprising 50 jiig/ml ampicillin at 30°C overnight. The resultant was sowed in a fermenter com- 
40 prising 8 liters of LB medium, and culture was conducted at 30°C until the turbidity at 600 nm became 0.2 to 0.5. 
Thereafter, isopropyl-1 -thio-|3-D-galactopyranoside was added to a final concentration of 1 mM, and the mixture was 
further cultured overnight to induce the metalloprotease domain of the protein to be expressed. The cultured E. coli 
were collected using a centrifuge (4°C for 30 minutes). 

[0101] Subsequently, the collected E. coli pellet was resuspended in distilled water, and lysozyme (final concentra- 
45 tion: 0.6 mg/ml) was added thereto. The mixture was stirred at room temperature for 30 minutes, allowed to stand at 
4°C overnight, and cells were then destroyed. After the ultrasonication, centrifugation was carried out using a centrifuge 
(4°C for 20 minutes), and the pellet was recovered. The recovered pellet was resuspended in a buffer comprising 50 
mM Tris, 10 mM EDTA, and 1% Triton X-100 (pH 8.0). These procedures of centrifugation, ultrasonication, and resus- 
pension were repeated several times, and the pellet was then resuspended in distilled water. Similarly, procedures of 
50 centrifugation, ultrasonication, and resuspension were repeated several times to recover an inclusion body. This in- 
clusion body was used as an antigen when producing an antibody. 

Example 12 

55 (Isolation of homologous gene of other animal species) 

[0102] The nucleic acid sequence as shown in SEQ ID NO: 15 was used as a probe, and a homology search was 
conducted using the BLASTN program at the GenomeNet WWW server (http://www.genome.ad.jp/). As a result, chro- 
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mosome clones AC091762 and AC090008 that were mapped at mouse chromosome 10 were obtained. Based on 
these sequences, a mouse homolog of the protease of the present invention as shown in SEQ ID NO: 34 was deduced. 
A new primer was designed from this sequence, and Northern blot analysis was conducted by the technique used in 
isolating and identifying the gene encoding the human vWF-cleaving protease. Thus, the occurrence of the specific 
5 expression in the liver was observed as with the case of humans. Further, normal mouse liver-derived poly A+ RNA 
and Marathon-Ready™ cDNA (Clontech) were used to isolate and identify the protease gene of the present invention 
by the RACE technique as in the case of humans. As a result, the mouse homologous gene sequences of the protease 
as shown in SEQ ID NOs: 35 and 36 were determined. 

[0103] Based on the thus determined mouse homologous partial sequence, the Exon/lntron structure on the 5' side 
10 of the aforementioned mouse chromosome 10 was determined. In accordance with a conventional technique (e.g., 
Gene Targeting: A Practial Approach First Edition, edited by A. L. Joyner, Teratocarci nomas and embryonic stem cell 
a practical approach), a targeting vector for knock-out (knock-in) mice can be prepared based thereon. This enabled 
the production of mutated mice. Further, this protein can be subjected to recombinant expression by a conventional 
technique. 

15 

Example 13 

(Production of an antibody and construction of a detection system for the present protease using the antibody) 

20 [0104] In accordance with a conventional technique (e.g., Current Protocols in Molecular Biology: Chapter 11 immu- 
nology, Antibody Engineering: A PRACTICAL APPROACH, edited by J. McCAFFERTY et al. or ANTIBODY ENGI- 
NEERING second edition, edited by Carl A. K. BORREBAECK), an expression vector was administered to a mouse 
or rat. This expression vector comprises a substance prepared by optionally binding an antigen protein partially purified 
from human plasma or a synthetic peptide having a partial amino acid sequence thereof (e.g., a C-terminal peptide 

25 sequence (SEQ ID NO: 37) Phe-Ser-Pro-Ala-Pro-Gln-Pro-Arg-Arg-Leu-Leu-Pro-Gly-Pro-Gln-Glu-Asn-Ser-Val-Gln- 
Ser-Ser, which was one isoform of the protease of the present invention ) to an optimal carrier substance such as KLH 
(Cys was added to, for example, the N- or C-terminus to facilitate KLH addition), the aforementioned gene recombinant 
protein, or a gene encoding this protein. Thus, a monoclonal antibody-expressing hybridoma was established, and a 
polyclonal antibody (antiserum) was produced. 

30 [0105] Subsequently; the antibodies prepared by the various aforementioned techniques were used to detect the 
protease of the present invention by Western blotting in accordance with a conventional technique (e.g., Current Pro- 
tocols in Molecular Biology: Chapter 10 analysis of proteins, Chapter 11 immunology). More specifically, the culture 
supernatant of the recombinant unit-expressing 293 cell obtained in the procedure as described in Example 9 was 
subjected to SDS-PAGE under non-reducing conditions, transferred to a PVDF membrane, and confirmed using mouse 

35 or rabbit antiserum to confirm the expression of the genetically recombinant unit (Fig. 1 6). As a result, a band that was 
deduced to be derived from the protease of the present invention was found in a molecular size range of 160 to 250 
kDa. Subsequently, the protease of the present invention was detected using starting plasma or the like and a recom- 
binant unit under non-reducing conditions. As a result, a band was found in 105 to 160 kDa or 160 to 250 kDa (Fig. 
17). Also, a band derived from a similar recombinant unit was detected in a monoclonal antibody established by im- 

40 munizing a recombinant protein (clone No. CPHSWH-10). 

[0106] Further, the C-terminal peptide sequence Phe-Ser-Pro-Ala-Pro-Gln-Pro-Arg-Arg-Leu-Leu-Pro-Gly-Pro-Gln- 
Glu-Asn-Ser-Val-Gln-Ser-Ser (SEQ ID NO: 37), which was one isoform of the protease of the present invention, was 
bound to KLH. The resultant was used as an immunogen to obtain a peptide antibody. With the use thereof, the protease 
of the present invention was detected from the plasma of healthy persons, plasma of TTP patients, or a culture super- 

45 natant of the recombinant unit under reducing conditions. As a result, a band of approximately 250 kDa that was 
deduced to be a signal derived from the protease of the present invention was found, although it was not clear based 
on plasma derived from some TTP patients (Fig. 1 8). 

[0107] Furthermore, enzyme immunoassay (ELISA) constructed by combining the obtained antibodies enabled the 
preparation of a calibration curve that is concentration-dependent at the culture supernatant level of the recombinant 

50 protein (Fig. 19). An example of ELISA is as follows. The obtained mouse anti-vWF-cleaving protease antibody was 
immobilized on the Maxisorp plate (Nunc), and 1/1, 1/2, and 1/4 diluents of the culture supernatant of the vWF-cleaving 
protease-temporarily expressing 293 cells were allowed to react in amounts of 100 jul/well (Mock supernatant as "0"). 
The plate was subjected to reaction, for example, at 37°C for 1 hour, and then washed with 0.05% Tween 20/TBS. 
Thereafter, the 100-fold diluted rabbit anti-vWF-cleaving protease antibody was allowed to react in amounts of 100 

55 well, for example, at 37°C for 1 hour, and the plate was washed with 0.05% Tween 20/TBS. The 1,000-fold diluted 
peroxidase-labeled anti-rabbit Ig antibody (BioRad) was then allowed to react in amounts of 100 jul/well, for example, 
at 37°C for 1 hour, and the plate was washed with 0.05% Tween 20/TBS. Thereafter, color was developed for a given 
period of time using a coloring substrate TMBZ, the reaction was terminated using 1M sulfuric acid as a termination 
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liquid, and the absorbance at 450 nm was assayed. The application thereof enabled the quantification of the protease 
of the present invention in a variety of specimens. 

Example 14 

5 

(Purification of the protease using an antibody) 

[0108] The obtained antibody was bound to a suitable immobilization carrier to prepare an affinity column, and the 
resulting column was used to purify, the protease of the present invention. The affinity column was prepared by immo- 

10 bilizing an antibody using Cellulofine for NHS activation (Chisso Corporation) in accordance with the included instruc- 
tions. The thus prepared swollen carrier (about 1 ml) was used to apply the culture supernatant in which the recombinant 
gene had been expressed in the 293 cell of the protease as described in Example 9. Thereafter, the column was 
washed with 50 mM Tris-HCI and 0.1M NaCI (pH 7.5, hereafter referred to as "TBS"), and elution was carried out using 
a urea-containing 0.1 M glycine buffer (pH 3). The eluted fraction was neutralized with 1 M Tris-HCI (pH 8.5) and then 

15 dialyzed against TBS. Fig. 20 shows the results of SDS-PAGE analysis of the resulting purified protease. Also, the 
resulting purified fraction was found to have vWF-cleaving activity. The cleavage point of the vWF fragmented by this 
recombinant protease was found to be the position between residues Tyr 842 and Met 843 based on the analysis of 
the N-terminal amino acid sequence of the fragment. Also established were clones (e.g., Clone Nos. CPHSWH-7.2 
and 1 0) that could be similarly subjected to purification with the use of the monoclonal antibody prepared by the method 

20 as described in Example 13. 

[0109] Subsequently, the partial amino acid sequence of the purified protease was determined. In accordance with 
a conventional technique, the protease was subjected to SDS-PAGE, transferred to a PVDF membrane, air-dried, and 
then subjected to analysis using an automated protein sequencer (model 492; PE Applied Biosystems). As a result, 
the protease was found to comprise Ala-Ala-Gly-Gly-lle- as a partial N-terminal sequence. This sequence was con- 

25 gruous with the N-terminal sequence of the mature unit of the protease of the present invention, that was deduced 
from the genetic construction. 

Example 15 

30 (Neutralization of the protease activity using an antibody) 

[0110] Activity of the aforementioned rabbit polyclonal antibody to neutralize the vWF-cleaving protease was eval- 
uated. Normal rabbit serum, rabbit antiserum comprising the C-terminal peptide sequence (SEQ ID NO: 37), Phe-Ser- 
Pro-Ala-Pro-Gln-Pro-Arg-Arg-Leu-Leu-Pro-Gly-Pro-Gln-Glu-Asn-Ser-Val-Gln-Ser-Ser bound to KLH as an immuno- 

35 gen, and antiserum, the immunity of which had been induced by the protein expressed by the expression vector as 
shown in Example 7 or 8, were respectively allowed to pre-react at 37°C for 1 hour with 1 to 1 0 |ug/ml of gene recom- 
binant vWF-cleaving protease (approximated by the Bradford technique) at a volume ratio of 1 :1 . Alternatively, a 5-fold 
diluted antiserum was allowed to pre-react under the above conditions with the protease at a volume ratio of 1:1. 
Thereafter, vWF-cleaving activity was evaluated by the method described above. As a result, it was found that antise- 

40 rum, which had activity of inhibiting the protease of the present invention, were prepared by immunizing the protein 
(Fig. 21). (antagonist activity) (a metalloprotease inhibitor, i.e., EDTA, was determined to be a control). This indicates 
the possibility of constructing an acquired TTP patient-like model having a positive autoantibody against vWF-cleaving 
protease as well as the simple possibility of producing a neutralizing antibody. 

45 Example 16 

(Construction of C-terminus deleted modification unit) 

[0111] Based on the strategy shown in Fig. 22, the full-length vWF-cleaving protease gene cloning vector (pCR 2.1 
50 vWFCP) obtained in Example 6 or 7 was used to add a variant lacking domains located in a position following the C- 
terminus (T1135stop, W1016stop, W897stop, T581stop, and Q449stop: each numerical value indicates the number 
of amino acid residues between Met encoded by the initiation codon AGT and the termination codon, and indicates a 
site comprising the FLAG epitope (DNA sequence: gactacaaggacgatgacgataagtga (SEQ ID NO: 47) and amino acid 
sequence: Asp Tyr Lys Asp Asp Asp Asp Lys (SEQ ID NO: 48)). Primers used herein are as follows. "S" indicates a 
55 sense primer, and "AS" indicates an antisense primer. Genes Stu l-S (SEQ ID NO: 38), Acc l-S (SEQ ID NO: 39), Avr 
ll-S (SEQ ID NO: 40), Q449stop-AS (SEQ ID NO: 41), T581stop-AS (SEQ ID NO: 42), W897stop-AS (SEQ ID NO: 
43), W1016stop-AS (SEQ ID NO: 44), T1135stop-AS (SEQ ID NO: 45), and full-length-AS (SEQ ID NO: 46) were 
prepared and incorporated in the pCAG expression vector in accordance with the method as used in Examples 8 and 
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9. This expression vector was introduced in the Helacell. The primer pair shown at the bottom of the restriction map 
in the upper portion of Fig. 22 was used to obtain PCR fragments (A) to (F). Each PCR fragment was ligated to pCR 
2.1 vWFCP. Further, the resultant was digested with Stul/Sall, and fragments (A) and (B) were digested with Stul/Sall 
and then ligated. These fragments were further digested with Accl, and fragment (C) was also digested with Accl, 
followed by ligation. The ligation product was digested with Avrll/Sall, and fragments (D), (E), and (F) were also digested 
with Avrll/Sall, followed by ligation. As a result, a variant lacking a region between the C-terminus and the position 
W897 was found to have activity, although it was the result of qualitative analysis. Such a way of approach enables 
the identification of various functional domains. The design of molecules comprising these domains and having no 
protease activity is considered to realize the design of antagonists or agonists. 

Industrial Applicability 

[0112] The findings of the present invention have led to the possibility of replacement therapy for patients having 
diseases resulting from deficiency of a protease, such as thrombotic thrombocytopenic purpura. This also realizes the 
establishment of methods for gene cloning and efficient purification from serum or plasma. In particular, the information 
provided by the present invention enables gene recombination based on the obtained nucleotide sequence and stable 
production and provision of the protease according to the present invention, which have been heretofore difficult to 
achieve. Also, these can be applied to replacement therapy for TTP patients, inhibition of platelet plug formation in- 
volved with heart infarction or brain infarction, inhibition of arteriosclerosis, prevention of restenosis, reembolization, 
or infarction involved with PTCA, prevention of reembolization involved with PTCR, and prevention of platelet plug 
formation caused by HUS or 0-157. Diagnosis and therapy utilizing the gene encoding the protease of the present 
invention or an antibody thereagainst can be realized. 

[0113] All publications cited herein are incorporated herein in their entirety. A person skilled in the art would easily 
understand that various modifications and changes of the present invention are feasible within the technical idea and 
the scope of the invention as disclosed in the attached claims. The present invention is intended to include such mod- 
ifications and changes. 
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SEQUENCE LISTING 

<110> JURIDICAL FOUNDATION THE CHEMO-SERO-THERAPEUT I C RESEARCH INSTITUTE 
<120>vWF-cleaving protease 
<130> PH1553-PCT 
<160>48 



<210>1 
<211>5 
<212>PRT 

<213> Homo sapiens 
<400>1 

Leu Leu Val Ala Val 
1 5 

<210>2 

<211>14 

<212>PRT 

<213> Homo sapiens 
<400>2 

Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Val 
1 5 10 

<210>3 

<211>161 

<212>PRT 

<213> Homo sapiens 
<400>3 
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Ala Ala Gly 
1 

Pro Asp Val 
Leu Thr Asn 
Leu Gly Ala 
Thr Glu Pro 
Ser Leu Leu 
Asp Asp Thr 
Arg Phe Asp 
Val Thr Gin 
He Thr Glu 
Glu lie Gly 



Gly lie Leu 
5 

Phe Gin Ala 
20 

Leu Asn He 
35 

Gin Phe Arg 

' 50 
Glu Gly Ala 
65 

Ser Val Cys 
80 

Asp Pro Gly 
95 

Leu Glu Leu 
110 

Leu Gly Gly 
125 

Asp Thr Gly 

140 
His Ser Phe 

155 



His Leu Glu 
His Gin Lys 
Gly Ala Glu 
Val His Leu 
Pro Asn He 
Gly Trp Ser 
His Ala Asp 
Pro Asp Gly 
Ala Cys Ser 
Phe Asp Leu 
Gly Leu Glu 



Leu Leu Val 
10 

Asp Thr Glu 
25 

Leu Leu Arg 
40 

Val Lys Met 
55 

Thr Ala Asn 
70 

Gin Thr He 

85 

Leu Val Leu 
100 

Asn Arg Gin 
115 

Pro Thr Trp 
130 

Gly Val Thr 
145 

His Asp 
160 



Ala Val Gly 
15 

Arg Tyr Val 
30 

Asp Pro Ser 
45 

Val He Leu 
60 

Leu Thr Ser 
75 

Asn Pro Glu 
90 

Tyr lie Thr 
105 

VaL Arg Gly 
120 

Ser Cys Leu 
135 

lie Ala His 
150 



<210>4 

<211>15 

<212>DNA 

<213> Homo sapiens 



22 



EP 1 391 516 A1 

<400>4 

ctgctggtgg ccgtg 

<210>5 

<211>42 

<212>DNA 

<213> Homo sapiens 
<400>5 

gctgcaggcg gcatcctaca cctggagctg ctggtggccg tg 

<2i0>6 

<211>483 

<212>DNA 

<213> Homo sapiens 
<406>6 

gctgcaggcg gcatcctaca cctggagctg ctggtggccg tgggccccga tgtcttccag 
gctcaccaga aggacacaga gcgctatgtg ctcaccaacc tcaacatcgg ggcagaactg 
cttcgggacc cgtccctggg ggctcagttt cgggtgcacc tggtgaagat ggtcattctg 
acagagcctg agggtgctcc aaatatcaca gcaaacctca cctcgtccct gctgagcgtc 
tgtgggtgga gccagaccat caaccctgag gacgacacgg atcctggcca tgctgacctg 
gtcctctata tcactaggtt tgacctggag ttgcctgatg gtaaccggca ggtgcggggc 
gtcacccagc tgggcggtgc ctgctcccca acctggagct gcctcattac cgaggacact 
ggcttcgacc tgggagtcac cattgcccat gagattgggc acagcttcgg cctggagcac 
gac 

<210>7 
<21l>161 
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<212>PRT 

<2l3> Homo sapiens 
<400>7 

get gca ggc ggc ate eta cac ctg gag ctg ctg gtg gee gtg ggc 
Ala Ala Gly Gly lie Leu His Leu Glu Leu Leu Val Ala Val Gly 
1 5 10 15 

ccc gat gtc ttc cag get cac cag aag gac aca gag cgc tat gtg 
Pro Asp Val Phe Gin Ala His Gin Lys Asp Thr Glu Arg Tyr Val 

20 25 30 

etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ccg tec 
Leu Thr Asn Leu Asn He Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 
Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val lie Leu 

50 55 60 

aca gag cct gag ggt get cca aat ate aca gca aac etc acc teg 
Thr Glu Pro Glu Gly Ala Pro Asn lie Thr Ala Asn Leu Thr Ser 

65 70 75 

tec ctg ctg age gtc tgt ggg tgg age cag acc ate aac cct gag 
Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 
Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He Thr 

95 100 105 

agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 
Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 

110 115 120 

gtc acc cag ctg ggc ggt gee tgc tec cca acc tgg age tgc etc 
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Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 
125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc acc att gcc cat 

He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr He Ala His 
140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac 

Glu He Gly His Ser Phe Gly Leu Glu His Asp 
155 160 



<210>8 

<211>29 

<212>PRT 

<213> Homo sapiens 
<400>8 

Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Val Gly 
15 10 15 

Pro Asp Val Phe Gin Ala His Gin Lys Asp Thr Arg Arg Tyr 
20 25 

<210>9 

<211>30 

<212>DNA 

<213> Homo sapiens 
<400>9 

getgeaggeg gcatcctaca cctggagctg 



<210>10 
<2 1 1>2 1 



25 



<212>DNA 

<213> Homo sapiens 
<400>10 

cccaatctca tgggcaatgg t 

<210>11 
<211>21 
<212>DNA 

<213> Homo sapiens 
<400>11 

cccaatctca tgggcaatgg t 



<210>12 
<211>30 
<212>DNA 

<213> Homo sapiens 
<400>12 

ccgatgttga ggttggtgag cacatagcgc 

<2lO>13 
<211>20 
<212>DNA 

<213> Homo sapiens 
<400>13 

gtgtcgtcct cagggttgat 



<2 10>14 
<211>21 
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<212>DNA 
5 <213> Homo sapiens 

<400>14 

accattgccc atgagattgg g 

10 

<210>15 
<211>4950 

15 

<212>DNA 

<213> Homo sapiens 
<400>15 

aaccacgatg tctttggcac agcctctcat 
ggagtcagcc gaggtcctgg cattccttgt 

25 

gtcccttctc cagattagat ggcttaggcc 
agagctgtct ggtgtgcagg actgggctgc 
30 caaggcccag cttgcagcag gaggagctgc 

ctggcaggaa gcttccaaga gtaaacactg 
ctgtcccatt ccatactgac cagattccca 

35 

ctcctcgggc tggctctcct gaggatgcac 
ctctgtgtgg ccggaatcct tgcctgtggc 

40 ttccagcaga gttgtcttca ggctttggag 

ggtgctccct taaaaggccg ccctccttcc 
aggcgggctg caggcggcat cctacacctg 

45 ttccaggctc accaggagga cacagagcgc 

gaactgcttc gggacccgtc cctgggggct 
attctgacag agcctgaggg tgctccaaat 

50 

agcgtctgtg ggtggagcca gaccatcaac 
gacctggtcc tctatatcac taggtttgac 
55 cggggcgtca cccagctggg cggtgcctgc 
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ctgtcagatg ggagcgggga ccccggagag 60 

gaacccccgt ctgtgggttt ctggtccagt 120 

tcctctaagg gggtgggcgt gcacatccgg 180 

aggttaccct gaactgcaac catcttagag 240 

aggccgccca ccctagccac ggcccctgcc 300 

cctaatcgtc ccgcccagta gtgagcaggc 360 

gtcaccaagg ccccctctca ctccgctcca 420 

cagcgtcacc cccgggcaag atgccctccc 480 

tttctcctgg gctgctgggg accctcccat 540 

ccacaggccg tgtcttctta cttgagccct 600 

cctggcttcc agaggcagag gcagaggcag 660 

gagctgctgg tggccgtggg ccccgatgtc 720 

tatgtgctca ccaacctcaa catcggggca 780 

cagtttcggg tgcacctggt gaagatggtc 840 

atcacagcca acctcacctc gtccctgctg 900 

cctgaggacg acacggatcc tggccatgct 960 

ctggagttgc ctgatggtaa ccggcaggtg 1020 

tccccaacct ggagctgcct cattaccgag 1080 
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crop of 1 f croo 


icgacciggg 


agtcaccai i 


gecca igaga 


[ igggcacag 


ct teggee tg 


1 1 A fi 

114U 


5 


gagcacgacg 


gcgcgcccgg 


cagcggctgc 


ggccccagcg 


gaeaegtgat 


ggc t teggae 


1200 




ggcgccgcgc 


cccgcgccgg 


cc tegectgg 


tccccc tgea 


gccgccggca 


get get gage 


1260 




ctgcicagcg 


caggaeggge 


gcgctgcgtg 


tgggacccgc 


rt ^Yrt rt? rt* rt rt ^ rt rt 

cgcggcc tea 


acccgggtcc 


1320 


10 


gcggggcacc 


cgccggatgc 


gcagcctggc 


etc tactaca 


gcgccaacga 


gcagtgccgc 


1380 




/^t 4* /^p jt rt rt t T rt at 

gtggcc l teg 


/r.rt rt rt O rt /TCT/> 

gccccaaggc 


4" /t "f - rt /Trt rt T rrn 

tgtcgcctgc 


rt a rt 4" T" rt rt? rt rt rt 

acct icgcca 


gggagcacc t 


rtrrtPrt f n f - rt* 4* rt*j-fc 

ggatatgtgc 


1 A A A 

1440 


15 


caggccctc t 


cctgccacac 


agacccgc tg 


gaccaaagca 


gctgcagccg 


cctcctcgt t 


1500 




cctctcctgg 


atgggacaga 


atgtggcgtg 


gagaagtggt 


gctccaaggg 


tcgctgccgc 


1560 




tccctggtgg 


agctgacccc 


catagcagca 


gtgcatgggc 


gctggtctag 


ctggggtccc 


1620 


20 


cgaagtcctt 


gctcccgctc 


ctgcggagga 


ggtgtggtca 


ecaggaggeg 


geagtgeaac 


1680 




aaccccagac 


ctgcctttgg 


ggggcgtgca 


tgtgttggtg 


ctgacctcca 


ggccgagatg 


1740 


25 


tgcaacactc 


aggectgega 


gaagacccag 


ctggagttca 


tgtegcaaca 


gtgegecagg 


1 8 AO 
lOUU 


accgacggcc 


agccgctgcg 


ctcctcccct 


ggcggcgcct 


ccttctacca 


ctggggtgct 


1860 




gctg.taccac 


acagecaagg 


ggatgctctg 


tgcagacaca 


tgtgccgggc 


cattggegag 


1920 


30 


agcttcatca 


tgaagcgtgg 


agacagcttc 


ctcgatggga 


cccggtgtat 


gccaagtggc 


1980 




ccccgggagg 


acgggaccct 


gagcctgtgt 


gtgtcgggca 


getgeaggae 


atttggctgt 


2040 




gatggtagga 


tggactccca 


gcaggtatgg 


gaeaggtgee 


aggtgtgtgg 


tggggacaac 


2100 


35 


ageaegtgea 


gcccacggaa 


gggctctttc 


acagctggca 


gagegagaga 


atatgtcacg 


2160 




tttctgacag 


ttacccccaa 


cctgaccagt 


gtctacat tg 


ccaaccacag 


gcctctcttc ' 


2220 


40 


acacacttgg 


eggtgaggat 


eggagggege 


tatgtcgtgg 


ctgggaagat 


gagcatctcc 


2280 




cctaacacca 


cctacccctc 


cctcctggag 


gatggtcgtg 


tcgagtacag 


agtggccctc 


2340 




accgaggacc 


ggctgccccg 


cctggaggag 


atccgcatct 


ggggacccct 


ccaggaagat 


2400 


45 


gctgacatcc 


aggtt tacag 


gcggtatggc 


gaggagtatg 


gcaacctcac 


ccgcccagac 


2460 




atcacc t tea 


cctact tcca 


gcctaagcca 


cggcaggcc t 


gggtgtgggc 


cgctgtgcgt 


2520 


50 


gggccctgct 


cggtgagctg 


tggggcaggg 


ctgcgctggg 


taaactacag 


ctgcctggac 


2580 


caggecagga 


aggagttggt 


ggagactgtc 


cagtgccaag 


ggagecagea 


gccaccagcg 


2640 




tggecagagg 


cctgcgtgct 


cgaaccctgc 


cctccctact 


gggcggtggg 


agacttegge 


2700 


55 


ccatgcagcg 


cctcctgtgg 


gggcggcctg 


egggagegge 


cagtgcgctg 


cgtggaggcc 


2760 
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cagggcagcc 


tcctgaagac 


attgccccca 


gcccggtgca 


gagcaggggc 


ccagcagcca 


2820 


5 


gctgtggcgc 


tggaaacctg 


caacccccag 


ccctgccctg 


ccaggtggga 


ggtgtcagag 


2880 




cccagctcat 


gcacatcagc 


tggtggagca 


ggcctggcct 


tggagaacga 


gacctgtgtg 


2940 


10 


ccaggggcag 


atggcctgga 


ggctccagtg 


actgaggggc 


ctggctccgt 


agatgagaag 


3000 


ctgcctgccc 


ctgagccctg 


tgtcgggatg 


tcatgtcctc 


caggctgggg 


ccatctggat 


3060 




gccacctctg 


caggggagaa 


ggctccctcc 


ccatggggca 


gcatcaggac 


gggggctcaa 


3120 


15 


gctgcacacg 


tgtggacccc 


tgcggcaggg 


tcgtgctccg 


tctcctgcgg 


gcgaggtctg 


3180 




atggagctgc 


gtttcctgtg 


catggactct 


gccctcaggg 


tgcctgtcca 


ggaagagctg 


3240 




tgtggcctgg 


caagcaagcc 


tgggagccgg 


cgggaggtct 


gccaggctgt 


cccgtgccct 


3300 


20 


gctcggtggc 


agtacaagct 


ggcggcctgc 


agcgtgagct 


gtgggagagg 


ggtcgtgcgg 


3360 




aggatcctgt 


attgtgcccg 


ggcccatggg 


gaggacgatg 


gtgaggagat 


cctgttggac 


3420 


25 


acccagtgcc 


aggggctgcc 


tcgcccggaa 


ccccaggagg 


cctgcagcct 


ggagccctgc 


3480 




ccacctaggt 


ggaaagtcat 


gtcccttggc 


ccatgttcgg 


ccagctgtgg 


ccttggcact 


3540 




gctagacgct 


cggtggcctg 


tgtgcagctc 


gaccaaggcc 


aggacgtgga 


ggtggacgag 


3600 


30 


gcggcctgtg 


cggcgctggt 


gcggcccgag 


gccagtgtcc 


cctgtctcat 


tgccgactgc 


3660 




acctaccgct 


ggcatgttgg 


cacctggatg 


gagtgctctg 


tt tcctgtgg 


ggatggcatc 


3720 


35 


cagcgccggc 


gtgacacctg 


cctcggaccc 


caggcccagg 


cgcctgtgcc 


agctgatttc 


3780 


tgccagcact 


tgcccaagcc 


ggtgactgtg 


cgtggctgct 


gggctgggcc 


ctgtgtggga 


3840 




cagggtacgc 


ccagcctggt 


gccccacgaa 


gaagccgctg 


ctccaggacg 


gaccacagcc 


3900 


40 


acccctgctg 


gtgcctccct 


ggagtggtcc 


caggcccggg 


gcctgctctt 


ctccccggct 


3960 




ccccagcctc 


ggcggctcct 


gcccgggccc 


caggaaaact 


cagtgcagtc 


cagtgcctgt 


4020 




ggcaggcagc 


accttgagcc 


aacaggaacc 


attgacatgc 


gaggcccagg 


gcaggcagac 


4080 


45 


tgtgcagtgg 


ccattgggcg 


gcccctcggg 


gaggtggtga 


ccctccgcgt 


ccttgagagt 


4140 




tctctcaact 


gcagtgcggg 


ggacatgttg 


ctgctttggg 


gccggctcac 


ctggaggaag 


4200 


50 


atgtgcagga 


agctgttgga 


catgactttc 


agctccaaga 


ccaacacgct 


ggtggtgagg 


4260 




cagcgctgcg 


ggcggccagg 


aggtggggtg 


ctgctgcggt 


atgggagcca 


get tgetect 


4320 




gaaaccttct 


acagagaatg 


tgacatgcag 


ctctttgggc 


cctggggtga 


aategtgagc 


4380 


55 


ccctcgctga 


gtccagccac 


gagtaatgca 


gggggctgcc 


ggctct teat 


taatgtggct 


4440 
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ccgcacgcac ggattgccat ccatgccctg gccaccaaca tgggcgctgg gaccgaggga 4500 
5 gccaatgcca gctacatctt gatccgggac acccacagct tgaggaccac agcgttccat 4560 
gggcagcagg tgctctactg ggagtcagag agcagccagg ctgagatgga gttcagcgag 4620 
ggcttcctga aggctcaggc cagcctgcgg ggccagtact ggaccctcca atcatgggta 4680 
ccggagatgc aggaccctca gtcctggaag ggaaaggaag gaacctgagg gtcattgaac 4740 
atttgttccg tgtctggcca gccctggagg gttgacccct ggtctcagtg ctttccaatt . 4800 
15 cgaacttttt ccaatcttag gtatctactt tagagtcttc tccaatgtcc aaaaggctag 4860 
ggggttggag gtggggactc tggaaaagca gcccccattt cctcgggtac caataaataa 4920 
aacatgcagg ccaaaaaaaa aaaaaaaaaa 4950 

20 

<210>16 
<211>1353 

25 

<212>PRT 

<213> Homo sapiens 
<400>16 

get gca ggc ggc ate eta cac ctg gag ctg ctg gtg gee gtg ggc 45 
Ala Ala Gly Gly lie Leu His Leu Glu Leu Leu Val Ala Yal Gly 
15 10 15 

ccc gat gtc ttc cag get cac cag gag gac aca gag cgc tat gtg 90 

40 Pro Asp Val Phe Gin Ala His Gin Glu Asp Thr Glu Arg Tyr Val 

20 25 30 

etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ceg tec 135 

45 Leu Thr Asn Leu Asn He Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 180 

50 

Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val He Leu 
50 55 60 

55 aca gag cct gag ggt get cca aat ate aca gec aac etc acc teg 225 
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Thr Glu Pro Glu Gly Ala Pro Asn lie Thr Ala Asn Leu Thr Ser 

65 70 75 

tec ctg ctg age gtc tgt ggg tgg age cag ace ate aac cct gag 
Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 
Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He Thr 

95 100 105 

agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 
Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 

110 115 120 

gtc acc cag ctg ggc ggt gee tgc tec cca acc tgg age tgc etc 
Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 

125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc acc att gee cat 
He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr He Ala His 

140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac ggc gcg ccc ggc 
Glu He Gly His Ser Phe Gly Leu Glu His Asp Gly Ala Pro Gly 

155 160 165 

age ggc tgc ggc ccc age gga cac gtg atg get teg gac ggc gec 
Ser Gly Cys Gly Pro Ser Gly His Val Met Ala Ser Asp Gly Ala 

170 175 180 

gcg ccc cgc gec ggc etc gec tgg tec ccc tgc age cgc egg cag 
Ala Pro Arg Ala Gly Leu Ala Trp Ser Pro Cys Ser Arg Arg Gin 

185 190 195 

ctg ctg age ctg etc age gca gga egg gcg cgc tgc gtg tgg gac 
Leu Leu Ser Leu Leu Ser Ala Gly Arg Ala Arg Cys Val Trp Asp 
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ccg ccg 
Pro Pro 

cag cct 
Gin Pro 

ttc ggc 
Phe Gly 

gat atg 
Asp Met 

age age 
Ser Ser 

tgt ggc 
Cys Gly 

gtg gag 
Yal Glu 

tgg ggt 
Trp Gly 

gtc ace 
Val Thr 



200 

egg cct caa 
Arg Pro Gin 
215 

ggc etc tac 
Gly Leu Tyr 
230 

ccc aag get 
Pro Lys Ala 
245 

tgc cag gec 
Cys Gin Ala 
260 

tgc age cgc 
Cys Ser Arg 
275 

gtg gag aag 
Val Glu Lys 
290 

ctg acc ccc 
Leu Thr Pro 
305 

ccc cga agt 
Pro Arg Ser 
320 

agg agg egg 
Arg Arg Arg 
335 



ccc ggg tec 
Pro Gly Ser 

tac age gee 
Tyr Ser Ala 

gtc gec tgc 
Val Ala Cys 

etc tec tgc 
Leu Ser Cys 

etc etc gtt 
Leu Leu Val 

tgg tgc tec 
Trp Cys Ser 

ata gca gca 
lie Ala Ala 

cct tgc tec 
Pro Cys Ser 

cag tgc aac 
Gin Cys Asn 



205 
gcg ggg 
Ala Gly 

220 
aac gag 
Asn Glu 

235 
acc ttc 
Thr Phe 

250 
cac aca 
His Thr 

265 
cct etc 
Pro Leu 

280 
aag ggt 
Lys Gly 

295 
gtg cat 
Val His 

310 
cgc tec 
Arg Ser 

325 
aac ccc 
Asn Pro 

340 



cac ccg 
His Pro 

cag tgc 
Gin Cys 

gee agg 
Ala Arg 

gac ccg 
Asp Pro 

ctg gat 
Leu Asp 

cgc tgc 
Arg Cys 

ggg cgc 
Gly Arg 

tgc gga 
Cys Gly 

aga cct 
Arg Pro 



210 

ccg gat gcg 
Pro Asp Ala 
225 

cgc gtg gee 
Arg Val Ala 
240 

gag cac ctg 
Glu His Leu 
255 

ctg gac caa 
Leu Asp Gin 
270 

ggg aca gaa 
Gly Thr Glu 
285 

cgc tec ctg 
Arg Ser Leu 
300 

tgg tct age 
Trp Ser Ser 
315 

gga ggt gtg 
Gly Gly Val 
330 

gee ttt ggg 
Ala Phe Gly 
345 
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ggg Cgt 

Gly Arg 

act cag 
Thr Gin 

tgc gcc 
Cys Ala 

gcc tec 
Ala Ser 

gat get 
Asp Ala 

ate atg 
lie Met 

cca agt 
Pro Ser 

ggc age 
Gly Ser 

cag gta 
Gin Val 



gca tgt 
Ala Cys 

gcc tgc 
Ala Cys 

agg acc 
Arg Thr 

ttc tac 
Phe Tyr 

ctg tgc 
Leu Cys 

aag cgt 
Lys Arg 

ggc ccc 
Gly Pro 

tgc agg 
Cys Arg 

tgg gac 
Trp Asp 



gtt ggt 
Val Gly 
350 

gag aag 
Glu Lys 
365 

gac ggc 
Asp Gly 
380 

cac tgg 
His Trp 



tgc age cca egg 



aga cac 
Arg His 
410 

gga gac 
Gly Asp 
425 

egg gag 
Arg Glu 
440 

aca ttt 
Thr Phe 
455 

agg tgc 
Arg Cys 
470 

aag ggc 



get gac etc 
Ala Asp Leu 

acc cag ctg 
Thr Gin Leu 

cag ccg ctg 
Gin Pro Leu 

ggt get get 
Gly Ala Ala 

atg tgc egg 
Met Cys Arg 

age ttc etc 
Ser Phe Leu 

gac ggg acc 
Asp Gly Thr 

ggc tgt gat 
Gly Cys Asp 

cag gtg tgt 
Gin Val Cys 

tct ttc aca 



cag gcc gag 
Gin Ala Glu 
355 

gag ttc atg 
Glu Phe Met 
370 

cgc tec tec 
Arg Ser Ser 
385 

gta cca cac 
Val Pro His 
400 

gcc att ggc 
Ala lie Gly 
415 

gat ggg acc 
Asp Gly Thr 
430 

ctg age ctg 
Leu Ser Leu 
445 

ggt agg atg 
Gly Arg Met 
460 

ggt ggg gac 
Gly Gly Asp 
475 

get ggc aga 



atg tgc aac 
Met Cys Asn 
360 

teg caa cag 
Ser Gin Gin 
375 

cct ggc ggc 
Pro Gly Gly 
390 

age caa ggg 
Ser Gin Gly 
405 

gag age ttc 
Glu Ser Phe 
420 

egg tgt atg 
Arg Cys Met 
435 

tgt gtg teg 
Cys Val Ser 
450 

gac tec cag 
Asp Ser Gin 
465 

aac age acg 
Asn Ser Thr 
480 

gcg aga gaa 
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Cys Ser Pro Arg Lys Gly Ser Phe Thr Ala Gly Arg Ala Arg Glu 

485 490 495 

tat gtc acg ttt ctg aca gtt acc ccc aac ctg acc agt gtc tac 
Tyr Val Thr Phe Leu Thr Val Thr Pro Asn Leu Thr Ser Val Tyr 

500 505 510 

att gcc aac cac agg cct etc ttc aca cac ttg gcg gtg agg ate 
He Ala Asn His Arg Pro Leu Phe Thr His Leu Ala Val Arg He 

515 520 525 

gga ggg cgc tat gtc gtg get ggg aag atg age ate tec cct aac 
Gly Gly Arg Tyr Val Val Ala Gly Lys Met Ser He Ser Pro Asn 

530 535 540. 

acc acc tac ccc tec etc ctg gag gat ggt cgt gtc gag tac aga 
Thr Thr Tyr Pro Ser Leu Leu Glu Asp Gly Arg Val Glu Tyr Arg 

545 550 . 555 

gtg gcc etc acc gag gac egg ctg ccc cgc ctg gag gag ate cgc 
Val Ala Leu Thr Glu Asp Arg Leu Pro Arg Leu Glu Glu He Arg 

560 565 570 

ate tgg gga ccc etc cag gaa gat get gac ate cag gtt tac agg 
He Trp Gly Pro Leu Gin Glu Asp Ala Asp lie Gin Val Tyr Arg 

575 580 585 

egg tat ggc gag gag tat ggc aac etc acc cgc cca gac ate acc 
Arg Tyr Gly Glu Glu Tyr Gly Asn Leu Thr Arg Pro Asp lie Thr 

590 595 600 

ttc acc tac ttc cag cct aag cca egg cag gcc tgg gtg tgg gcc 
Phe Thr Tyr Phe Gin Pro Lys Pro Arg Gin Ala Trp Val Trp Ala 

605 610 615 

get gtg cgt ggg ccc tgc teg gtg age tgt ggg gca ggg ctg cgc 
Ala Val Arg Gly Pro Cys Ser Val Ser Cys Gly Ala Gly Leu Arg 
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620 








625 


630 




5 


tgg 


gta 


aac 


tac 


age 


tgc 


ctg 


gac cag 


gec agg aag 


gag ttg gtg 


1935 




Trp 


Val 


Asn 


Tyr 


Ser 


Cys 


Leu 


Asp Gin 


Ala Arg Lys 


Glu Leu Val 














635 








640 


645 




10 


gag 


act 


gtc 


cag 


tgc 


caa 


ggg 


age cag 


cag cca cca 


gcg tgg cca 


1980 




Glu 


Thr 


Val 


Gin 


Cys 


Gin 


Gly 


Ser Gin 


Gin Pro Pro 


Ala Trp Pro 




15 










650 








655 


660 






gag 


gec 


tgc 


gtg 


etc 


gaa 


ccc 


tgc cct 


ccc tac tgg 


gcg gtg gga 


2025 




Glu 


Ala 


Cys 


Val 


Leu 


Glu 


Pro 


Cys Pro 


Pro Tyr Trp 


Ala Val Gly 




20 










665 








670 


675 






gac 


ttc 


ggc 


cca 


tgc 


age 


gec 


tec tgt 


ggg ggc ggc 


ctg egg gag 


2070 


25 


Asp 


Phe 


Gly 


Pro 


Cys 


Ser 


Ala 


Ser Cys 


Gly Gly Gly 


Leu Arg Glu 












680 








685 


690 






egg 


cca 


gtg 


cgc 


tgc 


gtg 


gag 


gec cag 


ggc age etc 


ctg aag aca 


2115 


30 


Arg 


Pro 


Val 


Arg 


Cys 


Val 


Glu 


Ala Gin 


Gly Ser Leu 


Leu Lys Thr 














695 








700 


705 






ttg 


ccc 


cca 


gec 


egg 


tgc 


aga 


gca ggg 


gec cag cag 


cca get gtg 


2160 


35 


Leu 


Pro 


Pro 


Ala 


Arg 


Cys 


Arg 


Ala Gly 


Ala Gin Gin 


Pro Ala Val 














710 








715 


720 




40 


gcg 


ctg 


gaa 


acc 


tgc 


aac 


ccc 


cag ccc 


tgc cct gee 


agg tgg gag 


2205 




Ala 


Leu 


Glu 


Thr 


Cys 


Asn 


Pro 


Gin Pro 


Cys Pro Ala 


Arg Trp Glu 














725 








730 


735 




45 


gtg 


tea 


gag 


ccc 


age 


tea 


tgc 


aca tea 


get ggt gga 


gca ggc ctg 


2250 




Val 


Ser 


Glu 


Pro 


Ser 


Ser 


Cys 


Thr Ser 


Ala Gly Gly 


Ala Gly Leu 




50 










740 








745 


750 




gec 


ttg 


gag 


aac 


gag 


acc 


tgt 


gtg cca 


ggg gca gat 


ggc ctg gag 


2295 




Ala 


Leu 


Glu 


Asn 


Glu 


Thr 


Cys 


Val Pro 


Gly Ala Asp 


Gly Leu Glu 




55 










755 








760 


765 
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get cca gtg act 
Ala Pro Val Thr 

gec cct gag ccc 
Ala Pro Glu Pro 

cat ctg gat gec 
His Leu Asp Ala 

ggc age ate agg 
Gly Ser He Arg 

gcg gca ggg teg 
Ala Ala Gly Ser 

ctg cgt ttc ctg 
Leu Arg Phe Leu 

gaa gag ctg tgt 
Glu Glu Leu Cys 

gtc tgc cag get 
Val Cys Gin Ala 

gcg gec tgc age 
Ala Ala Cys Ser 

ctg tat tgt gee 



gag ggg 
Glu Gly 
770 

tgt gtc 
Cys Val 
785 

acc tct 
Thr Ser 
800 

acg ggg 
Thr Gly 
815 

tgc tec 
Cys Ser 
830 

tgc atg 
Cys Met 
845 

ggc ctg 
Gly Leu 
860 

gtc ccg 
Val Pro 
875 

gtg age 
Val Ser 
890 

egg gec 



cct ggc tec 
Pro Gly Ser 

ggg atg tea 
Gly Met Ser 

gca ggg gag 
Ala Gly Glu 

get caa get 
Ala Gin Ala 

gtc tec tgc 
Val Ser Cys 

gac tct gee 
Asp Ser Ala 

gca age aag 
Ala Ser Lys 

tgc cct get 
Cys Pro Ala 

tgt ggg aga 
Cys Gly Arg 

cat ggg gag 



gta gat 
Val Asp 
775 

tgt cct 
Cys Pro 
790 

aag get 
Lys Ala 
805 

gca cac 
Ala His 

820 

ggg cga 
Gly Arg 
835 

etc agg 
Leu Arg 
850 

cct ggg 
Pro Gly 
865 

egg tgg 
Arg Trp 
880 

ggg gtc 
Gly Val 
895 

gac gat 



gag aag 
Glu Lys 

cca ggc 
Pro Gly 

ccc tec 
Pro Ser 

gtg tgg 
Val Trp 

ggt ctg 
Gly Leu 

gtg cct 
Val Pro 

age egg 
Ser Arg 

cag tac 
Gin Tyr 

gtg egg 
Val Arg 

ggt gag 



ctg cct 
Leu Pro 

780 
tgg ggc 
Trp Gly 

795 
cca tgg 
Pro Trp 

810 
acc cct 
Thr Pro 

825 
atg gag 
Met Glu 

840 
gtc cag 
Val Gin 

855 
egg gag 
Arg Glu 

870 
aag ctg 
Lys Leu 

885 
agg ate 
Arg He 

900 
gag ate 
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Leu Tyr Cys Ala 

ctg ttg gac acc 
Leu Leu Asp Thr 

gag gcc tgc age 
Glu Ala Cys Ser 

tec ctt ggc cca 
Ser Leu Gly Pro 

cgc icg gtg gcc 
Arg Ser Val Ala 

gtg gac gag gcg 
Val Asp Glu Ala 

gtc ccc tgt etc 
Val Pro Cys Leu 

acc tgg atg gag 
Thr Trp Met Glu 

egg cgt gac acc 
Arg Arg Asp Thr 

get gat ttc tgc 
Ala Asp Phe Cys 



Arg Ala 
905 

cag tgc 
Gin Cys 
920 

ctg gag 
Leu Glu 
935 

tgt teg 
Cys Ser 
950 

tgt gtg 
Cys Val 
965 

gcc tgt 
Ala Cys 
980 

att gcc 
He Ala 
995 

tgc tct 
Cys Ser 
1010 
tgc etc 
Cys Leu 
1025 
cag cac 
Gin His 



His Gly Glu 

cag ggg ctg 
Gin Gly Leu 

ccc tgc cca 
Pro Cys Pro 

gcc age tgt 
Ala Ser Cys 

cag etc gac 
Gin Leu Asp 

gcg gcg ctg 
Ala Ala Leu 

gac tgc acc 
Asp Cys Thr 

gtt tec tgt 
Val Ser Cys 

gga ccc cag 
Gly Pro Gin 

ttg ccc aag 
Leu Pro Lys 



Asp Asp 
910 

cct cgc 
Pro Arg 
925 

cct agg 
Pro Arg 
940 

ggc ctt 
Gly Leu 
955 

caa ggc 
Gin Gly 
970 

gtg egg 
Val Arg 
985 

tac cgc 
Tyr Arg 
1000 
ggg gat 
Gly Asp 
1015 
gcc cag 
Ala Gin 
1030 
ccg gtg 
Pro Val 



Gly Glu 

ccg gaa 
Pro Glu 

tgg aaa 
Trp Lys 

ggc act 
Gly Thr 

cag gac 
Gin Asp 

ccc gag 
Pro Glu 

tgg cat 
Trp His 

ggc ate 
Gly He 

gcg cct 
Ala Pro 

act gtg 
Thr Val 



Glu He 

915 
ccc cag 
Pro Gin 

930 
gtc atg 
Val Met 

945 
get aga 
Ala Arg 

960 
gtg gag 
Val Glu 

975 
gcc agt 
Ala Ser 

990 
gtt ggc 
Val Gly 

1005 
cag cgc 
Gin Arg 

1020 
gtg cca 
Val Pro 

1035 
cgt ggc 
Arg Gly 
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1040 1045 1050 

tgc tgg get ggg ccc tgt gtg gga cag ggt acg ccc age ctg gtg 

Cys Trp Ala Gly Pro Cys Val Gly Gin Gly Thr Pro Ser Leu Val 

1055 1060 1065 

ccc cac gaa gaa gee get get cca gga egg ace aca gee acc cct 

Pro His Glu Glu Ala Ala Ala Pro Gly Arg Thr Thr Ala Thr Pro 

1070 1075 1080 

get ggt gee tec ctg gag tgg tec cag gee egg ggc ctg etc ttc 

Ala Gly Ala Ser Leu Glu Trp Ser Gin Ala Arg Gly Leu Leu Phe 

1085 1090 1095 

tec ccg get ccc cag cct egg egg etc ctg ccc ggg ccc cag gaa 

Ser Pro Ala Pro Gin Pro Arg Arg Leu Leu Pro Gly Pro Gin Giu 

1100 1105 1110 

aac tea gtg cag tec agt gee tgt ggc agg cag cac ctt gag cca 

.Asn Ser Val Gin Ser Ser Ala Cys Gly Arg Gin His Leu Glu Pro 

1115 1120 1125 

aca gga acc att gac atg cga ggc cca ggg cag gca gac tgt gca 

Thr Gly Thr He Asp Met Arg Gly Pro Gly Gin Ala Asp Cys Ala 

1130 1135 1140 

gtg gec att ggg egg ccc etc ggg gag gtg gtg acc etc cgc gtc 

Val Ala He Gly Arg Pro Leu Gly Glu Val Val Thr Leu Arg Val 

1145 1150 1155 

ctt gag agt tct etc aac tgc agt gcg ggg gac atg ttg ctg ctt 

Leu Glu Ser Ser Leu Asn Cys Ser Ala Gly Asp Met Leu Leu Leu 

1160 1165 1170 

tgg ggc egg etc acc tgg agg aag atg tgc agg aag ctg ttg gac 

Trp Gly Arg Leu Thr Trp Arg Lys Met Cys Arg Lys Leu Leu Asp 

1175 1180 1185 
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atg act ttc age tec aag ace aac acg ctg gtg gtg agg cag cgc 

Met Thr Phe Ser Ser Lys Thr Asn Thr Leu Val Val Arg Gin Arg 

l 190 1195 1200 

tgc ggg egg cca gga ggt ggg gtg ctg ctg egg tat ggg age cag 

Cys Gly Arg Pro Gly Gly Gly Val Leu Leu Arg Tyr Gly Ser Gin 

1205 1210 1215 

ctt get cct gaa ace ttc tac aga gaa tgt gac atg cag etc ttt 

Leu Ala Pro Glu Thr Phe Tyr Arg Glu Cys Asp Met Gin Leu Phe 

1220 1225 1230 

ggg ccc tgg ggt gaa ate gtg age ccc teg ctg agt cca gee acg 

Gly Pro Trp Gly Glu He Val Ser Pro Ser Leu Ser Pro Ala Thr 

1235 1240 1245 

agt aat gca ggg ggc tgc egg etc ttc att aat gtg get ccg cac 

Ser Asn Ala Gly Gly Cys Arg Leu Phe He Asn Val Ala Pro His 

1250 1255 1260 

gca egg att gec ate cat gec ctg gec acc aac atg ggc get ggg 

Ala Arg lie Ala He His Ala Leu Ala Thr Asn Met Gly Ala Gly 

1265 1270 1275 

acc gag gga gec aat gec age tac ate ttg ate egg gac acc cac 

Thr Glu Gly Ala Asn Ala Ser Tyr He Leu He Arg Asp Thr His 

1280 1285 1290 

age ttg agg acc aca gcg ttc cat ggg cag cag gtg etc tac tgg 

Ser Leu Arg Thr Thr Ala Phe His Gly Gin Gin Val Leu Tyr Trp 

1295 1300 1305 

gag tea gag age age cag get gag atg gag ttc age gag ggc ttc 

Glu Ser Glu Ser Ser Gin Ala Glu Met Glu Phe Ser Glu Gly Phe 

1310 1315 1320 

ctg aag get cag gee age ctg egg ggc cag tac tgg acc etc caa 
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Leu Lys Ala Gin Ala Ser Leu Arg Gly Gin Tyr Trp Thr Leu Gin 

1325 1330 1335 

tea tgg gta ccg gag atg cag gac cct cag tec tgg aag gga aag 

Ser Trp Val Pro Glu Met Gin Asp Pro Gin Ser Trp Lys Gly Lys 

1340 1345 1350 

gaa gga acc 
Glu Gly Thr 

<210>17 

<211>1297 

<212>PRT 

<213> Homo sapiens 
<400>17 

get gpa.ggc. ggc ate. eta cac ctg gag ctg. ctg gtg gec gtg ggc 

Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Val Gly 
15 10 15 

ccc gat gtc ttc cag get cac cag gag gac aca gag cgc tat gtg 

Pro Asp Val Phe Gin Ala His Gin Glu Asp Thr Glu Arg Tyr Val 

20 25 30 

etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ccg tec 

Leu Thr Asn Leu Asn He Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 

Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val He Leu 

50 55 60 

aca gag cct gag ggt get cca aat ate aca gee aac etc acc teg 

Thr Glu Pro Glu Gly Ala Pro Asn He Thr Ala Asn Leu Thr Ser 

65 70 75 
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tec ctg ctg age gtc tgt ggg tgg age cag ace ate aac cct gag 270 
Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 315 
Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He Thr 

95 100 105 

agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 360 
Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 

110 115 120 

gtc acc cag ctg ggc ggt gec tgc tec cca acc tgg age tgc etc 405 
Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 

125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc acc att gec cat 450 
lie Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr He- Ala His 

140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac ggc gcg ccc ggc 495 
Glu lie Gly His Ser Phe Gly Leu Glu His Asp Gly Ala Pro Gly 

155 160 165 

age ggc tgc ggc ccc age gga cac gtg atg get teg gac ggc gec 540 
Ser Gly Cys Gly Pro Ser Gly His Val Met Ala Ser Asp Gly Ala 

170 175 180 

gcg ccc cgc gec ggc etc gec tgg tec ccc tgc age cgc egg cag 585 
Ala Pro Arg Ala Gly Leu Ala Trp Ser Pro Cys Ser Arg Arg Gin 

185 190 195 

ctg ctg age ctg etc age gca gga egg gcg cgc tgc gtg tgg gac 630 
Leu Leu Ser Leu Leu Ser Ala Gly Arg Ala Arg Cys Val Trp Asp 

200 205 210 

ccg ccg egg cct caa ccc ggg tec gcg ggg cac ccg ccg gat gcg 675 
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Pro Pro Arg Pro Gin Pro Gly Ser Ala Gly His Pro Pro Asp Ala 

215 220 225 

cag cct ggc etc tac tac age gec aac gag cag tgc cgc gtg gec 
Gin Pro Gly Leu Tyr Tyr Ser Ala Asn Glu Gin Cys Arg Val Ala 

230 235 240 

ttc ggc ccc aag get gtc gec tgc acc ttc gec agg gag cac ctg 
Phe Gly Pro Lys Ala Val Ala Cys Thr Phe Ala Arg Glu His Leu 

245 250 255 

gat atg tgc cag gee etc tec tgc cac aca gac ccg ctg gac caa 
Asp Met Cys Gin Ala Leu Ser Cys His Thr Asp Pro Leu Asp Gin 

260 265 270 

age age tgc age cgc etc etc gtt cct etc ctg gat ggg aca gaa 
Ser Ser Cys Ser Arg Leu Leu Yal Pro Leu Leu Asp Gly Thr Glu 

275 280 285 

tgt ggc gtg gag aag tgg tgc tec aag ggt cgc. tgc cgc tec ctg 
Cys Gly Val Glu Lys Trp Cys Ser Lys Gly Arg Cys Arg Ser Leu 

290 295 300 

gtg gag ctg acc ccc ata gca gca gtg cat ggg cgc tgg tct age 
Val Glu Leu Thr Pro He Ala Ala Val His Gly Arg Trp Ser Ser 

305 310 315 

tgg ggt ccc cga agt cct tgc tec cgc tec tgc gga gga ggt gtg 
Trp Gly Pro Arg Ser Pro Cys Ser Arg Ser Cys Gly Gly Gly Val 

320 325 330 

gtc acc agg agg egg cag tgc aac aac ccc aga cct gee ttt ggg 
Val Thr Arg Arg Arg Gin Cys Asn Asn Pro Arg Pro Ala Phe Gly 

335 340 345 

ggg cgt gca tgt gtt ggt get gac etc cag gee gag atg tgc aac 
Gly Arg Ala Cys Val Gly Ala Asp Leu Gin Ala Glu Met Cys Asn 
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act cag gcc 
Thr Gin Ala 

tgc gcc agg 
Cys Ala Arg 

gcc tec ttc 
Ala Ser Phe 

gat get ctg 
Asp Ala Leu 

ate atg aag 
He Met Lys 

cca agt ggc 
Pro Ser Gly 

ggc age tgc 
Gly Ser Cys 

cag gta tgg 
Gin Val Trp 

tgc age cca 
Cys Ser Pro 



350 
tgc gag aag 
Cys Glu Lys 

365 

ace gac ggc 
Thr Asp Gly 

380 
tac cac tgg 
Tyr His Trp 

395 
tgc aga cac 
Cys Arg His 

410 
cgt gga gac 
Arg Gly Asp 

425 
ccc egg gag 
Pro Arg Glu 

440 
agg aca ttt 
Arg Thr Phe 

455 
gac agg tgc 
Asp Arg Cys 

470 
egg aag ggc 
Arg Lys Gly 

485 



ace cag ctg 
Thr Gin Leu 

cag ccg ctg 
Gin Pro Leu 

ggt get get 
Gly Ala Ala 

atg tgc egg 
Met Cys Arg 

age ttc etc 
Ser Phe Leu 

gac ggg ace 
Asp Gly Thr 

ggc tgt gat 
Gly Cys Asp 

cag gtg tgt 
Gin Val Cys 

tct ttc aca 
Ser Phe Thr 



355 

gag ttc 
Glu Phe 
370 

cgc tec 
Arg Ser 
385 

gta cca 
Val Pro 
400 

gcc att 
Ala He 
415 

gat ggg 
Asp Gly 
430 

ctg age 
Leu Ser 
445 

ggt agg 
Gly Arg 
460 

ggt ggg 
Gly Gly 
475 

get ggc 
Ala Gly 
490 



atg teg 
Met Ser 

tec cct 
Ser Pro 

cac age 
His Ser 

ggc gag 
Gly Glu 

ace egg 
Thr Arg 

ctg tgt 
Leu Cys 

atg gac 
Met Asp 

gac aac 
Asp Asn 

aga gcg 
Arg Ala 



360 
caa cag 
Gin Gin 

375 
ggc ggc 
Gly Gly 

390 
caa ggg 
Gin Gly 

405 
age ttc 
Ser Phe 

420 
tgt atg 
Cys Met 

435 
gtg teg 
Val Ser 

450 
tec cag 
Ser Gin 

465 
age acg 
Ser Thr 

480 
aga gaa 
Arg Glu 

495 
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tat gtc acg ttt 
Tyr Val Thr Phe 

att gcc aac cac 
He Ala Asn His 

gga ggg cgc tat 
Gly Gly Arg Tyr 

acc acc tac ccc 
Thr Thr Tyr Pro 

gtg gcc etc acc 
Val Ala Leu Thr 

ate tgg gga ccc 
He Trp Gly Pro 

egg tat ggc gag 
Arg Tyr Gly Glu 

ttc acc tac ttc 
Phe Thr Tyr Phe 

get gtg cgt ggg 
Ala Val Arg Gly 

tgg gta aac tac 



ctg aca 
Leu Thr 
500 

agg cct 
Arg Pro 
515 

gtc gtg 
Val Val 
530 

tec etc 
Ser Leu 
545 

gag gac 
Glu Asp 
560 

etc cag 
Leu Gin 
575 

gag tat 
Glu Tyr 
590 

cag cct 
Gin Pro 
605 

ccc tgc 
Pro Cys 
620 

age tgc 



gtt acc 
Val Thr 

etc ttc 
Leu Phe 

get ggg 
Ala Gly 

ctg gag 
Leu Glu 

egg ctg 
Arg Leu 

gaa gat 
Glu Asp 

ggc aac 
Gly Asn 

aag cca 
Lys Pro 

teg gtg 
Ser Val 

ctg gac 



ccc aac 
Pro Asn 

505 
aca cac 
Thr His 

520 
aag atg 
Lys Met 

535 
gat ggt 
Asp Gly 

550 
ccc cgc 
Pro Arg 

565 
get gac 
Ala Asp 

580 
etc acc 
Leu Thr 

595 
egg cag 
Arg Gin 

610 
age tgt 
Ser Cys 

625 
cag gcc 



ctg acc 
Leu Thr 

ttg gcg 
Leu Ala 

age ate 
Ser He 

cgt gtc 
Arg Val 

ctg gag 
Leu Glu 

ate cag 
He Gin 

cgc cca 
Arg Pro 

gcc tgg 
Ala Trp 

ggg gca 
Gly Ala 

agg aag 



agt gtc tac 
Ser Val Tyr 
510 

gtg agg ate 
Val Arg He 
525 

tec cct aac 
Ser Pro Asn 
540 

gag tac aga 
Glu Tyr Arg 

555 

gag ate cgc 
Glu He Arg 
570 

gtt tac agg 
Val Tyr Arg 
585 

gac ate acc 
Asp He Thr 
600 

gtg tgg gcc 
Val Trp Ala 
615 

ggg ctg cgc 
Gly Leu Arg 
630 

gag ttg gtg 
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Trp Val Asn Tyr Ser Cys Leu Asp Gin Ala Arg Lys Glu Leu Val 

635 640 645 

gag act gtc cag tgc caa ggg age cag cag cca cca gcg tgg cca 
Glu Thr Val Gin Cys Gin Gly Ser Gin Gin Pro Pro Ala Trp Pro 

650 655 660 

gag gec tgc gtg etc gaa ccc tgc cct ccc tac tgg gcg gtg gga 
Glu Ala Cys Val Leu Glu Pro Cys Pro Pro Tyr Trp Ala Val Gly 

665 670 675 

gac ttc ggc cca tgc age gec tec tgt ggg ggc ggc ctg egg gag 
Asp Phe Gly Pro Cys Ser Ala Ser Cys Gly Gly Gly Leu Arg Glu 

680 685 690 

egg cca gtg cgc tgc gtg gag gec cag ggc age etc ctg aag aca 
Arg Pro Val Arg Cys Val Glu Ala Gin Gly Ser Leu Leu Lys Thr 

695 . 700 705 

ttg ccc cca gec egg tgc aga gca ggg gec cag cag cca get gtg 
Leu Pro Pro Ala Arg Cys Arg Ala Gly Ala Gin Gin Pro Ala Val 

710 715 720 

gcg ctg gaa acc tgc aac ccc cag ccc tgc cct gec agg tgg gag 
Ala Leu Glu Thr Cys Asn Pro Gin Pro Cys Pro Ala Arg Trp Glu 

725 730 735 

gtg tea gag ccc age tea tgc aca tea get ggt gga gca ggc ctg 
Val Ser Glu Pro Ser Ser Cys Thr Ser Ala Gly Gly Ala Gly Leu 

740 745 750 

gec ttg gag aac gag acc tgt gtg cca ggg gca gat ggc ctg gag 
Ala Leu Glu Asn Glu Thr Cys Val Pro Gly Ala Asp Gly Leu Glu 

755 760 765 

get cca gtg act gag ggg cct ggc tec gta gat gag aag ctg cct 
Ala Pro Val Thr Glu Gly Pro Gly Ser Val Asp Glu Lys Leu Pro 
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gcc cct 
Ala Pro 

cat ctg 
His Leu 

ggc age 
Gly Ser 

gcg gca 
Ala Ala 

ctg cgt 
Leu Arg 

gaa gag 
Glu Glu 

gtc tgc 
Val Cys 

gcg gcc 
Ala Ala 

ctg tat 
Leu Tyr 



gag ccc 
Glu Pro 

gat gcc 
Asp Ala 

ate agg 
He Arg 

ggg teg 
Gly Ser 

ttc ctg 
Phe Leu 

ctg tgt 
Leu Cys 

cag get 
Gin Ala 

tgc age 
Cys Ser 

tgt gcc 
Cys Ala 



770 

tgt gtc 
Cys Yal 
785 

acc tct 
Thr Ser 
800 

acg ggg 
Thr Gly 
815 

tgc tec 
Cys Ser 
830 

tgc atg 
Cys Met 
845 

ggc ctg 
Gly Leu 
860 

gtc ccg 
Val Pro 
875 

gtg age 
Val Ser 
890 

egg gcc 
Arg Ala 
905 



ggg atg tea 
Gly Met Ser 

gca ggg gag 
Ala Gly Glu 

get caa get 
Ala Gin Ala 

gtc tec tgc 
Val Ser Cys 

gac tct gcc 
Asp Ser Ala 

gca age aag 
Ala Ser Lys 

tgc cct get 
Cys Pro Ala 

tgt ggg aga 
Cys Gly Arg 

cat ggg gag 
His Gly Glu 



775 

tgt cct cca 
Cys Pro Pro 
790 

aag get ccc 
Lys Ala Pro 
805 

gca cac gtg 
Ala His Val 
820 

ggg cga ggt 
Gly Arg Gly 
835 

etc agg gtg 
Leu Arg Val 
850 

cct ggg age 
Pro Gly Ser 
865 

egg tgg cag 
Arg Trp Gin 
880 

ggg gtc gtg 
Gly Val Val 
895 

gac gat ggt 
Asp Asp Gly 
910 



780 

ggc tgg ggc 
Gly Trp Gly 
795 

tec cca tgg 
Ser Pro Trp 
810 

tgg acc cct 
Trp Thr Pro 
825 

ctg atg gag 
Leu Met Glu 
840 

cct gtc cag 
Pro Val Gin 
855 

egg egg gag 
Arg Arg Glu 
870 

tac aag ctg 
Tyr Lys Leu 
885 

egg agg ate 
Arg Arg He 
900 

gag gag ate 
Glu Glu He 
915 



46 



EP 1 391 516 A1 



ctg ttg gac acc cag tgc cag ggg ctg cct cgc ccg gaa ccc cag 
Leu Leu Asp Thr Gin Cys Gin Gly Leu Pro Arg Pro Glu Pro Gin 

920 925 930 

gag gcc tgc age ctg gag ccc tgc cca cct agg tgg aaa gtc atg 
Glu Ala Cys Ser Leu Glu Pro Cys Pro Pro Arg Trp Lys Val Met 

935 940 945 

tec ctt ggc cca tgt teg gcc age tgt ggc ctt ggc act get aga 
Ser Leu Gly Pro Cys Ser Ala Ser Cys Gly Leu Gly Thr Ala Arg 

950 955 960 

cgc teg gtg gcc tgt gtg cag etc gac caa ggc cag gac gtg gag 
Arg Ser Val Ala Cys Val Gin Leu Asp Gin Gly Gin Asp Val Glu 

965 970 975 

gtg gac gag gcg gcc tgt gcg gcg ctg gtg egg ccc gag gcc agt 
Val. Asp Glu Ala Ala Cys Ala Ala Leu Val Arg Pro Glu Ala Ser 

980 985 990 

gtc ccc tgt etc att gcc gac tgc acc tac cgc tgg cat gtt ggc 
Val Pro Cys Leu lie Ala Asp Cys Thr Tyr Arg Trp His Val Gly 

995 1000 1005 

acc tgg atg gag tgc tct gtt tec tgt ggg gat ggc ate cag cgc 
Thr Trp Met Glu Cys Ser Val Ser Cys Gly Asp Gly He Gin Arg 

1010 1015 1020 

egg cgt gac acc tgc etc gga ccc cag gcc cag gcg cct gtg cca 
Arg Arg Asp Thr Cys Leu Gly Pro Gin Ala Gin Ala Pro Val Pro 

1025 1030 1035 

get gat ttc tgc cag cac ttg ccc aag ccg gtg act gtg cgt ggc 
Ala Asp Phe Cys Gin His Leu Pro Lys Pro Val Thr Val Arg Gly 

1040 1045 1050 

tgc tgg get ggg ccc tgt gtg gga cag ggt gcc tgt ggc agg cag 
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Cys Trp Ala Gly Pro Cys Val Gly Gin Gly Ala Cys Gly Arg Gin 

1055 1060 1065 

cac ctt gag cca aca gga acc att gac atg cga ggc cca ggg cag 
His Leu Glu Pro Thr Gly Thr lie Asp Met Arg Gly Pro Gly Gin 

1070 1075 1080 

gca gac tgt gca gtg gcc att ggg egg ccc etc ggg gag gtg gtg 
Ala Asp Cys Ala Val Ala He Gly Arg Pro Leu Gly Glu Val Val 

1085 1090 1095 

acc etc cgc gtc ctt gag .agt tct etc aac tgc agt gcg ggg gac 
Thr Leu Arg Val Leu Glu Ser Ser Leu Asn Cys Ser Ala Gly Asp 

1100 1105 1110 

atg ttg ctg ctt tgg ggc egg etc acc tgg agg aag atg tgc agg 
Met Leu Leu Leu Trp Gly Arg Leu Thr Trp Arg Lys Met Cys Arg 
.1115 .1120 1125 

aag ctg ttg gac atg act ttc age tec aag acc aac acg ctg gtg 
Lys Leu Leu Asp Met Thr Phe Ser Ser Lys Thr Asn Thr Leu Val 

1130 1135 1140 

gtg agg cag cgc tgc ggg. egg cca gga ggt ggg gtg ctg ctg egg 
Val Arg Gin Arg Cys Gly Arg Pro Gly Gly Gly Val Leu Leu Arg 

1145 1150 1155 

tat ggg age cag ctt get cct gaa acc ttc tac aga gaa tgt gac 
Tyr Gly Ser Gin Leu Ala Pro Glu Thr Phe Tyr Arg Glu Cys Asp 

1160 1165 1170 

atg cag etc ttt ggg ccc tgg ggt gaa ate gtg age ccc teg ctg 
Met Gin Leu Phe Gly Pro Trp Gly Glu He Val Ser Pro Ser Leu 

1175 1180 1185 

agt cca gcc acg agt aat gca ggg ggc tgc egg etc ttc att aat 
Ser Pro Ala Thr Ser Asn Ala Gly Gly Cys Arg Leu Phe He Asn 



48 



EP 1 391 516 A1 



gtg get 
Val Ala 

atg ggc 
Met Gly 

egg gac 
Arg Asp 

gtg etc 
Val Leu 

age gag 
Ser Glu 

tgg ace 
Trp Thr 

tgg aag 
Trp Lys 



ccg cac 
Pro His 

get ggg 
Ala Gly 

ace cac 
Thr His 

tac tgg 
Tyr Trp 

ggc t tc 
Gly Phe 

etc caa 
Leu Gin 

gga aag 
Gly Lys 



H90 

gca egg att 
Ala Arg He 
1205 

ace gag gga 
Thr Glu Gly 
1220 

age ttg agg 
Ser Leu Arg 
1235 

gag tea gag 
Glu Ser Glu 
1250 

ctg aag get 
Leu Lys Ala 
1265 

tea tgg gta 
Ser Trp Val 
1280 

gaa gga ace 
Glu Gly Thr 
1295 



gec ate 
Ala He 

gec aat 
Ala Asn 

acc aca 
Thr Thr 

age age 
Ser Ser 

cag gee 
Gin Ala 

ccg gag 
Pro Glu 



1195 
cat gec 
His Ala 
1210 
gee age 
Ala Ser 
1225 
gcg ttc 
Ala Phe 
1240 
cag get 
Gin Ala 
1255 
age ctg 
Ser Leu 
1270 
atg cag 
Met Gin 
1285 



ctg gee 
Leu Ala 

tac ate 
Tyr He 

cat ggg 
His Gly 

gag atg 
Glu Met 

egg ggc 
Arg Gly 

gac cct 
Asp Pro 



1200 
acc aac 
Thr Asn 

1215 
ttg ate 
Leu He 

1230 
cag cag 
Gin Gin 

1245 
gag ttc 
Glu Phe 

1260 
cag tac 
Gin Tyr 

1275 
cag tec 
Gin Ser 

1290 



<210>18 
<21 1> 1378 
<212>PRT 

<213> Homo sapiens 
<400>18 
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get gca ggc ggc 
Ala Ala Gly Gly 
1 

ccc gat gtc ttc 
Pro Asp Val Phe 

etc acc aac etc 
Leu Thr Asn Leu 

ctg ggg get cag 
Leu Gly Ala Gin 

aca gag cct gag 
Thr Glu Pro Glu 

tec ctg ctg age 
Ser Leu Leu Ser 

gac gac acg gat 
Asp Asp Thr Asp 

agg ttt gac ctg 
Arg Phe Asp Leu 

gtc acc cag ctg 
Val Thr Gin Leu 

at t acc gag gac 



ate eta 
lie Leu 
5 

cag get 
Gin Ala 
20 

aac ate 
Asn He 
35 

ttt egg 
Phe Arg 
50 

ggt get 
Gly Ala 
65 

gtc tgt 
Val Cys 
80 

cct ggc 
Pro Gly 
95 

gag ttg 
Glu Leu 
110 

ggc ggt 
Gly Gly 
125 

act ggc 



cac ctg 
His Leu 

cac cag 
His Gin 

ggg gca 
Gly Ala 

gtg cac 
Val His 

cca aat 
Pro Asn 

ggg tgg 
Gly Trp 

cat get 
His Ala 

cct gat 
Pro Asp 

gec tgc 
Ala Cys 

ttc gac 



gag ctg 
Glu Leu 
10 

gag gac 
Glu Asp 
25 

gaa ctg 
Glu Leu 
40 

ctg gtg 
Leu Val 

f r 

oo 

ate aca 
He Thr 
70 

age cag 
Ser Gin 

85 

gac ctg 
Asp Leu 

100 
ggt aac 
Gly Asn 

115 
tec cca 
Ser Pro 

130 
ctg gga 



ctg gtg 
Leu Val 

aca gag 
Thr Glu 

ctt egg 
Leu Arg 

aag atg 
Lys Met 

gec aac 
Ala Asn 

acc ate 
Thr He 

gtc etc 
Val Leu 

egg cag 
Arg Gin 

acc tgg 
Thr Trp 

gtc acc 



gec gtg. ggc 
Ala Val Gly 
15 

cgc tat gtg 
Arg Tyr Val 
30 

gac ccg tec 
Asp Pro Ser 
45 

gtc att ctg 
Val He Leu 
60 

etc acc teg 
Leu Thr Ser 
75 

aac cct gag 
Asn Pro Glu 
90 

tat ate act 
Tyr He Thr 
105 

gtg egg ggc 
Val Arg Gly 
120 

age tgc etc 
Ser Cys Leu 
135 

att gec cat 
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He Thr Glu Asp Thr Gly Phe 
140 

gag att ggg cac age ttc ggc 
Glu lie Gly His Ser Phe Gly 
155 

age ggc tgc ggc ccc age gga 
Ser Gly Cys Gly Pro Ser Gly 
170 

gcg ccc cgc gec ggc etc gec 
Ala Pro Arg Ala Gly Leu Ala 
185 

ctg ctg age ctg etc agg acg 
Leu Leu Ser Leu Leu Arg Thr 
200 

gcg gec tea acc egg gtc cgc 
Ala Ala Ser Thr Arg Val Arg 
215 

tgg cct eta eta cag cgc caa 
Trp Pro Leu Leu Gin Arg Gin 
230 

ccc caa ggc tgt cgc ctg cac 
Pro Gin Gly Cys Arg Leu His 
245 

tct gec ggc ggt ggc ctg gga 
Ser Ala Gly Gly Gly Leu Gly 
260 

acc cag etc acg tec ccc caa 
Thr Gin Leu Thr Ser Pro Gin 
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Asp Leu Gly Val Thr He Ala His 

145 150 

ctg gag cac gac ggc gcg ccc ggc 

Leu Glu His Asp Gly Ala Pro Gly 

160 165 

cac gtg atg get teg gac ggc gec 

His Val Met Ala Ser Asp Gly Ala 

175 180 

tgg tec ccc tgc age cgc egg cag 

Trp Ser Pro Cys Ser Arg Arg Gin 

190 195 

ggc gcg ctg cgt gtg gga ccc gec 

Gly Ala Leu Arg Val Gly Pro Ala 

205 210 

ggg gca ccc gee gga tgc gca gee 

Gly Ala Pro Ala Gly Cys Ala Ala 

220 225 

cga gca gtg ccg cgt ggc ctt egg 

Arg Ala Val Pro Arg Gly Leu Arg 

235 240 

ctt cgc cag gga gca cct ggt gag 

Leu Arg Gin Gly Ala Pro Gly Glu 

250 255 

ttg get gtg agg tec etc cgc ate 

Leu Ala Val Arg Ser Leu Arg He 

265 270 

acg tgc atg gat atg tgc cag gee 

Thr Cys Met Asp Met Cys Gin Ala 
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etc tec 
Leu Ser 

etc etc 
Leu Leu 

tgg tgc 
Trp Cys 

ata gca 
He Ala 

cct tgc 
Pro Cys 

cag tgc 
Gin Cys 

ggt get 
Gly Ala 

aag acc 
Lys Thr 

ggc cag 
Gly Gin 



tgc cac 
Cys His 

gtt cct 
Val Pro 

tec aag 
Ser Lys 

gca gtg 
Ala Val 

tec cgc 
Ser Arg 

aac aac 
Asn Asn 

gac etc 
Asp Leu 

cag ctg 
Gin Leu 

ccg ctg 
Pro Leu 



275 

aca gac 
Thr Asp 
290 

etc ctg 
Leu Leu 
305 

ggt cgc 
Gly Arg 
320 

cat ggg 
His Gly 
335 

tec tgc 
Ser Cys 
350 

ccc aga 
Pro Arg 
365 

cag gee 
Gin Ala 
380 

gag ttc 
Glu Phe 
395 

cgc tec 
Arg Ser 
410 



ccg ctg gac 
Pro Leu Asp 

gat ggg aca 
Asp Gly Thr 

tgc cgc tec 
Cys Arg Ser 

cgc tgg tct 
Arg Trp Ser 

gga gga ggt 
Gly Gly Gly 

cct gee ttt 
Pro Ala Phe 

gag atg tgc 
Glu Met Cys 

atg teg caa 
Met Ser Gin 

tec cct ggc 
Ser Pro Gly 



280 

caa age 
Gin Ser 
295 

gaa tgt 
Glu Cys 
310 

ctg gtg 
Leu Val 
325 

age tgg 
Ser Trp 
340 

gtg gtc 
Val Val 
355 

ggg ggg 
Gly Gly 
370 

aac act 
Asn Thr 
385 

cag tgc 
Gin Cys 
400 

ggc gee 
Gly Ala 
415 



age tgc 
Ser Cys 

ggc gtg 
Gly Val 

gag ctg 
Glu Leu 

ggt ccc 
Gly Pro 

acc agg 
Thr Arg 

cgt gca 
Arg Ala 

cag gee 
Gin Ala 

gee agg 
Ala Arg 

tec ttc 
Ser Phe 



285 
age cgc 
Ser Arg 

300 
gag aag 
Glu Lys 

315 
acc ccc 
Thr Pro 

330 
cga agt 
Arg Ser 

345 
agg egg 
Arg Arg 

360 
tgt gtt 
Cys Val 

375 
tgc gag 
Cys Glu 

390 
acc gac 
Thr Asp 

405 
tac cac 
Tyr His 

420 
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tgg ggt 

Tip Gly 

cac atg 
His Met 

gac age 
Asp Ser 

gag gac 
Glu Asp 

ttt ggc 
Phe Gly 

tgc cag 
Cys Gin 

ggc tct 
Gly Ser 

aca gtt 
Thr Val 

cct etc 
Pro Leu 



get get 
Ala Ala 

tgc egg 
Cys Arg 

ttc etc 
Phe Leu 

ggg acc 
Gly Thr 

tgt gat 
Cys Asp 

gtg tgt 
Val Cys 

ttc aca 
Phe Thr 

acc ccc 
Thr Pro 

ttc aca 
Phe Thr 



gtg get ggg aag 



gta cca 
Val Pro 
425 

gec att 
Ala He 
440 

gat ggg 
Asp Gly 
455 

ctg age 
Leu Ser 
470 

ggt agg 
Gly Arg 
485 

ggt ggg 
Gly Gly 
500 

get ggc 
Ala Gly 
515 

aac ctg 
Asn Leu 
530 

cac ttg 
His Leu 
545 

atg age 



cac age 
His Ser 

ggc gag 
Gly Glu 

acc egg 
Thr Arg 

ctg tgt 
Leu Cys 

atg gac 
Met Asp 

gac aac 
Asp Asn 

aga gcg 
Arg Ala 

acc agt 
Thr Ser 

gcg gtg 
Ala Val 

ate tec 



caa ggg 
Gin Gly 

430 
age ttc 
Ser Phe 

445 
tgt atg 
Cys Met 

460 
gtg teg 
Val Ser 

475 
tec cag 
Ser Gin 

490 
age acg 
Ser Thr 

505 
aga gaa 
Arg Glu 

520 
gtc tac 
Val Tyr 

535 
agg ate 
Arg He 

550 
cct aac 



gat get 
Asp Ala 

ate atg 
He Met 

cca agt 
Pro Ser 

ggc age 
Gly Ser 

cag gta 
Gin Val 

tgc age 
Cys Ser 

tat gtc 
Tyr Val 

att gec 
He Ala 

gga ggg 
Gly Gly 

acc acc 



ctg tgc aga 
Leu Cys Arg 
435 

aag cgt gga 
Lys Arg Gly 
450 

ggc ccc egg 
Gly Pro Arg 
465 

tgc agg aca 
Cys Arg Thr 
480 

tgg gac agg 
Trp Asp Arg 
495 

cca egg aag 
Pro Arg Lys 
510 

acg ttt ctg 
Thr Phe Leu 
525 

aac cac agg 
Asn His Arg 
540 

cgc tat gtc 
Arg Tyr Val 
555 

tac ccc tec 
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U 1 A 1 

Val Ala 


Gly 


Lys Met 


fx 

Ser 


He 


Ser Pro Asn 


Thr 


Thr 


Tyr 


Pro 


Ser 




5 






560 






565 










570 






etc ctg 


gag 


gat ggt 


cgt 


gtc 


gag tac aga 


gtg 


gec 


etc 


acc 


gag 


1755 




Leu Leu 


Glu 


Asp Gly 


Arg 


Val 


Glu Tyr Arg 


Val 


Ala 


Leu 


Thr 


Glu 




10 






575 






580 










585 






gac egg 


ctg 


ccc cgc 


ctg 


gag 


gag ate cgc 


ate 


tgg 


gga 


ccc 


etc 


1800 


15 


Asp Arg 


Leu 


Pro Arg 
590 


Leu 


Glu 


Glu He Arg 
595 


He 


Trp 


Gly 


Pro 


Leu 
600 






cag gaa 


gat 


get gac 


ate 


cag 


gtt tac agg 


egg 


tat 


ggc 


gag 


gag 


1845 


20 


Gin Glu 


Asp 


Ala Asp 
605 


He 


Gin 


Val Tyr Arg 
610 


Arg 


Tyr 


Gly 


Glu 


Glu . 
615 




25 


tat ggc 


aac 


etc acc 


cgc 


cca 


gac ate acc 


tic 


acc 


tac 


ttc 


cag 


1890 


Tyr Gly 


Asn 


Leu Thr 
620 


Arg 


Pro 


Asp lie Thr 

. 625 


Phe 


Thr 


Tyr 


Phe 


Gin 
630 




30 


cct aag 


cca 


egg cag 


gec 


tgg 


gtg tgg gec 


get 


gtg 


cgt 


ggg 


ccc 


1935 




Pro Lys 


Pro 


Arg Gin 

635 


Ala 


Trp 


Val Trp Ala 
640 


Ala 


Val 


Arg 


Gly 


Pro 

645 




35 


tgc teg 


gtg 


age tgt 


ggg 


gca 


ggg ctg cgc 


tgg 


gta 


aac 


tac 


age 


1980 




Cys Ser 


Val 


Ser Cys 


Gly 


Ala 


Gly Leu Arg 


Trp 


Val 


Asn 


Tyr 


Ser 




40 






650 






655 










660 






tgc ctg 


gac 


cag gec 


agg 


aag 


gag ttg gtg 


gag 


act 


gtc 


cag 


tgc 


2025 




Cys Leu 


Asp 


Gin Ala 


Arg 


Lys 


Glu Leu Val 


Glu 


Thr 


Val 


Gin 


Cys 




45 






665 






670 










675 






caa ggg 


age 


cag cag 


cca 


cca 


gcg tgg cca 


gag 


gee 


tgc 


gtg 


etc 


2070 


50 


Gin Gly 


Ser 


Gin Gin 


Pro 


Pro 


Ala Trp Pro 


Glu 


Ala 


Cys 


Val 


Leu 








680 






685 










690 






gaa ccc 


tgc 


cct ccc 


tac 


tgg 


gcg gtg gga 


gac 


ttc 


ggc 


cca 


tgc 


2115 


55 


Glu Pro 


Cys 


Pro Pro 


Tyr 


Trp 


Ala Val Gly 


Asp 


Phe 


Gly 


Pro 


Cys 
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age gec 
Ser Ala 

gtg gag 
Val Glu 

tgc aga 
Cys Arg 

aac ccc 
Asii Pro 

tea tgc 
Ser Cys 

acc tgt 
Thr Cys 

ggg cct 
Gly Pro 

gtc ggg 
Val Gly 

tct gca 
Ser Ala 



695 

tec tgt ggg 
Ser Cys Gly 
710 

gec cag ggc 
Ala Gin Gly 
725 

gca ggg gec 
Ala Gly Ala 
740 

cag ccc tgc 
Gin Pro Cys 
755 

aca tea get 
Thr Ser Ala 
770 

gtg cca ggg 
Val Pro Gly 
785 

ggc tec gta 
Gly Ser Val 
800 

atg tea tgt 
Met Ser Cys 
815 

ggg gag aag 
Gly Glu Lys 
830 



ggc ggc 
Gly Gly 

age etc 
Ser Leu 

cag cag 
Gin Gin 

cct gec 
Pro Ala 

ggt gga 
Gly Gly 

gca gat 
Ala Asp 

gat gag 
Asp Glu 

cct cca 
Pro Pro 

get ccc 
Ala Pro 



700 

ctg egg gag 
Leu Arg Glu 
715 

ctg aag aca 
Leu Lys Thr 
730 

cca get gtg 
Pro Ala Val 
745 

agg tgg gag 
Arg Trp Glu 
760 

gca ggc ctg 
Ala Gly Leu 
775 

ggc ctg gag 
Gly Leu Glu 
790 

aag ctg cct 
Lys Leu Pro 
805 

ggc tgg ggc 
Gly Trp Gly 
820 

tec cca tgg 
Ser Pro Trp 
835 



egg cca 
Arg Pro 

ttg ccc 
Leu Pro 

gcg ctg 
Ala Leu 

gtg tea 
Val Ser 

gee ttg 
Ala Leu 

get cca 
Ala Pro 

gee cct 
Ala Pro 

cat ctg 
His Leu 

ggc age 
Gly Ser 



705 

gtg cgc tgc 
Val Arg Cys 
720 

cca gec egg 
Pro Ala Arg 
735 

gaa acc tgc 
Glu Thr Cys 
750 

gag ccc age 
Glu Pro Ser 
765 

gag aac gag 
Glu Asn Glu 
780 

gtg act gag 
Val Thr Glu 
795 

gag ccc tgt 
Glu Pro Cys 
810 

gat gee acc 
Asp Ala Thr 
825 

ate agg acg 
He Arg Thr 
840 
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ggg get caa get gca cac gtg tgg acc cct gcg gca ggg teg tgc 

Gly Ala Gin Ala Ala His Val Trp Thr Pro Ala Ala Gly Ser Cys 

845 850 855 

tec gtc tec tgc ggg cga ggt ctg atg gag ctg cgt ttc ctg tgc 

Ser Val Ser Cys Gly Arg Gly Leu Met Glu Leu Arg Phe Leu Cys 

860 865 870 

atg gac tct gee etc agg gtg cct gtc cag gaa gag ctg tgt ggc 

Met Asp Ser Ala Leu Arg Val Pro Val Gin Glu Glu Leu Cys Gly 

875 880 885 

ctg gca age aag cct ggg age egg egg gag gtc tgc cag get gtc 

Leu Ala Ser Lys Pro Gly Ser Arg Arg Glu Val Cys Gin Ala Val 

8S0 895 900 

ccg tgc cct get egg tgg cag tac aag ctg gcg gee tgc age gtg 

Pro Cys Pro Ala Arg Trp Gin Tyr.Lys Leu Ala Ala Cys Ser Val 

905 910 915 

age tgt ggg aga ggg gtc gtg egg agg ate ctg tat tgt gec egg 

Ser Cys Gly Arg Gly Val Val Arg Arg He Leu Tyr Cys Ala Arg 

920 925 930 

gec cat ggg gag gac gat ggt gag gag ate ctg ttg gac acc cag 

Ala His Gly Glu Asp Asp Gly Glu Glu He Leu Leu Asp Thr Gin 

935 940 945 

tgc cag ggg ctg cct cgc ccg gaa ccc cag gag gee tgc age ctg 

Cys Gin Gly Leu Pro Arg Pro Glu Pro Gin Glu Ala Cys Ser Leu 

950 955 960 

gag ccc tgc cca cct agg tgg aaa gtc atg tec ctt ggc cca tgt 

Glu Pro Cys Pro Pro Arg Trp Lys Val Met Ser Leu Gly Pro Cys 

965 970 975 

teg gec age tgt ggc ctt ggc act get aga cgc teg gtg gec tgt 
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Ser Ala Ser Cys Gly Leu Gly Thr Ala Arg Arg Ser Val Ala Cys 

980 985' 990 

gtg cag etc gac caa ggc cag gac gtg gag gtg gac gag gcg gec 
Val Gin Leu Asp Gin Gly Gin Asp Val Glu Val Asp Glu Ala Ala 

995 1000 1005 

tgt gcg gcg ctg gtg egg ccc gag gec agt gtc ccc tgt etc att 
Cys Ala Ala Leu Val Arg Pro Glu Ala Ser Val Pro Cys Leu He 

1010 1015 1020 

gec gac tgc acc tac cgc tgg cat gtt ggc acc tgg atg gag tgc 
Ala Asp Cys Thr Tyr Arg Trp His Val Gly Thr Trp Met Glu Cys 

1025 1030 1035 

ict gtt tec tgt ggg gat ggc ate cag cgc egg cgt gac acc tgc 
Ser Val Ser Cys Gly Asp Gly He Gin Arg Arg Arg Asp Thr Cys 

1040 . 1045 1050 

etc gga ccc cag gee cag gcg cct gtg cca get gat ttc tgc cag 
Leu Gly Pro Gin Ala Gin Ala Pro Val Pro Ala Asp Phe Cys Gin 

1055 1060 1065 

cac ttg ccc aag ccg gtg act gtg cgt ggc tgc tgg get ggg ccc 
His Leu Pro Lys Pro Val Thr Val Arg Gly Cys Trp Ala Gly Pro 

1070 1075 1080 

tgt gtg gga cag ggt acg ccc age ctg gtg ccc cac gaa gaa gee 
Cys Val Gly Gin Gly Thr Pro Ser Leu Val Pro His Glu Glu Ala 

1085 1090 1095 

get get cca gga egg acc aca gec acc cct get ggt gee tec ctg 
Ala Ala Pro Gly Arg Thr Thr Ala Thr Pro Ala Gly Ala Ser Leu 

1100 1105 1110 

gag tgg tec cag gec egg ggc ctg etc ttc tec ccg get ccc cag 
Glu Trp Ser Gin Ala Arg Gly Leu Leu Phe Ser Pro Ala Pro Gin 
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1 1 15 1120 1125 

cct egg egg etc ctg ccc ggg ccc cag gaa aac tea gtg cag tec 
Pro Arg Arg Leu Leu Pro Gly Pro Gin Glu Asn Ser Val Gin Ser 

1130 1135 1140 

agt gec tgt ggc agg cag cac ctt gag cca aca gga acc att gac 
Ser Ala Cys Gly Arg Gin His Leu Glu Pro Thr Gly Thr lie Asp 

1145 1150 1155 

atg cga ggc cca ggg cag gca gac tgt gca gtg gec att ggg egg 
Met Arg Gly Pro Gly Gin Ala Asp Cys Ala Val Ala He Gly Arg 

1160 1165 1170 

ccc etc ggg gag gtg gtg acc etc cgc gtc ctt gag agt tct etc 
Pro Leu Gly Glu Val Val Thr Leu Arg Val Leu Glu Ser Ser Leu 

1175 1180 1185 

aac tgc agt gcg ggg gac atg ttg ctg ctt tgg ggc. egg etc acc 
Asn Cys Ser Ala Gly Asp Met Leu Leu Leu Trp Gly Arg Leu Thr 

1190 1195 1200 

tgg agg aag atg tgc agg aag ctg ttg gac atg act ttc age tec 
Trp Arg Lys Met Cys Arg Lys Leu Leu Asp Met Thr Phe Ser Ser 

1205 1210 1215 

aag acc aac acg ctg gtg gtg agg cag cgc tgc ggg egg cca gga 
Lys Thr Asn Thr Leu Val Val Arg Gin Arg Cys Gly Arg Pro Gly 

1220 1225 1230 

ggt ggg gtg ctg ctg egg tat ggg age cag ctt get cct gaa acc 
Gly Gly Val Leu Leu Arg Tyr Gly Ser Gin Leu Ala Pro Glu Thr 

1235 1240 1245 

ttc tac aga gaa tgt gac atg cag etc ttt ggg ccc tgg ggt gaa 
Phe Tyr Arg Glu Cys Asp Met Gin Leu Phe Gly Pro Trp Gly Glu 

1250 1255 1260 
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ate gtg age ccc teg ctg agt cca gec acg agt aat gca ggg ggc 

He Val Ser Pro Ser Leu Ser Pro Ala Thr Ser Asn Ala Gly Gly 

1265 1270 1275 

tgc egg etc ttc att aat gtg get ccg cac gca egg att gee ate 

Cys Arg Leu Phe lie Asn Val Ala Pro His Ala Arg He Ala lie 

1280 1285 1290 

cat gec ctg gec acc aac atg ggc get ggg acc gag gga gec aat 

His Ala Leu Ala Thr Asn Met Gly Ala Gly Thr Glu Gly Ala Asn 

1295 1300 1305 

gec age tac ate ttg ate egg gac acc cac age ttg agg acc aca 

Ala Ser Tyr He Leu He Arg Asp Thr His Ser Leu Arg Thr Thr 

1310 1315 1320 

gcg ttc cat ggg cag cag gtg etc tac tgg gag tea gag age age 

Ala Phe His Gly Gin Gin Val Leu Tyr Trp Glu. Ser Glu Ser Ser 

1325 1330 1335 

cag get gag atg gag ttc age gag ggc ttc ctg aag get cag gec 

Gin Ala Glu Met Glu Phe Ser Glu Gly Phe Leu Lys Ala Gin Ala 

1340 1345 1350 

age ctg egg ggc cag tac tgg acc etc caa tea tgg gta ccg gag 

Ser Leu Arg Gly Gin Tyr Trp Thr Leu Gin Ser Trp Val Pro Glu 

1355 1360 1365 
atg cag gac cct cag tec tgg aag gga aag gaa gga acc 
Met Gin Asp Pro Gin Ser Trp Lys Gly Lys Glu Gly Thr 

1370 1375 



<210>19 

<211>1322 

<212>PRT 
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<2l3> Homo sapiens 
<400>l9 

get gca ggc ggc ate 
Ala Ala Gly Gly lie 
1 5 
ccc gat gtc ttc cag 
Pro Asp Val Phe Gin 
20 

etc ace aac etc aac 
Leu Thr Asn Leu Asn 
35 

etg ggg get cag ttt 
Leu Gly Ala Gin Phe 
50 

aca gag cct gag ggt 
Thr Glu Pro Glu Gly 
65 

tec ctg ctg age gtc 
Ser Leu Leu Ser Val 
80 

gac gac acg gat cct 
Asp Asp Thr Asp Pro 
95 

agg ttt gac ctg gag 
Arg Phe Asp Leu Glu 
110 

gtc ace cag ctg ggc 
Val Thr Gin Leu Gly 



eta cac ctg gag ctg 
Leu His Leu Glu Leu 
10 

get cac cag gag gac 
Ala His Gin Glu Asp 
25 

ate ggg gca gaa ctg 
He Gly Ala Glu Leu 
40 

egg gtg cae etg gtg 
Arg Val His Leu Val 
55 

get cca aat ate aca 
Ala Pro Asn He Thr 
70 

tgt ggg tgg age cag 
Cys Gly Trp Ser Gin 
85 

ggc cat get gac ctg 
Gly His Ala Asp Leu 
100 

ttg cct gat ggt aac 
Leu Pro Asp Gly Asn 
115 

ggt gee tgc tec cca 
Gly Ala Cys Ser Pro 



ctg gtg gec gtg ggc 
Leu Val Ala Val Gly 
15 

aca gag cgc tat gtg 
Thr Glu Arg Tyr Val 
30 

ctt egg gac ccg tec 
Leu Arg Asp Pro Ser 
45 

aag atg gtc at t ctg 
Lys Met Val He Leu 
60 

gee aac etc ace teg 
Ala Asn Leu Thr Ser 
75 

acc ate aac cct gag 
Thr He Asn Pro Glu 
90 

gtc etc tat ate act 
Val Leu Tyr He Thr 
105 

egg cag gtg egg ggc 
Arg Gin Val Arg Gly 
120 

acc tgg age tgc etc 
Thr Trp Ser Cys Leu 
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att acc 
He Thr 

gag att 
Glu He 

age ggc 
Ser Gly 

gcg ccc 

M. 
ni a 



ctg ctg 
Leu Leu 

gcg gec 
Ala Ala 

tgg cct 
Trp Pro 

ccc caa 
Pro Gin 

tct gec 
Ser Ala 



125 

gag gac act 
Glu Asp Thr 
140 

ggg cac age 
Gly His Ser 
155 

tgc ggc ccc 
Cys Gly Pro 
170 

cgc gec ggc 
Arg Ala Gly 
185 

age ctg etc 
Ser Leu Leu 
200 

tea acc egg 
Ser Thr Arg 
215 

eta eta cag 
Leu Leu Gin 
230 

ggc tgt cgc 
Gly Cys Arg 
245 

ggc ggt ggc 
Gly Gly Gly 
260 



ggc ttc gac 
Gly Phe Asp 

ttc ggc ctg 
Phe Gly Leu 

age gga cac 
Ser Gly His 

etc gec tgg 
Leu Ala Trp 

agg acg ggc 
Arg Thr Gly 

gtc cgc ggg 
Val Arg Gly 

cgc caa cga 
Arg Gin Arg 

ctg cac ctt 
Leu His Leu 

ctg gga ttg 
Leu Gly Leu 



130 

ctg gga gtc 
Leu Gly Val 
145 

gag cac gac 
Glu His Asp 
160 

gtg atg get 
Val Met Ala 
175 

tec ccc tgc 
Ser Pro Cys 
190 

gcg ctg cgt 
Ala Leu Arg 

205 
gca ccc gee 
Ala Pro Ala 

220 
gca gtg ccg 
Ala Val Pro 

235 
cgc cag gga 
Arg Gin Gly 

250 
get gtg agg 
Ala Val Arg 

265 



acc att 
Thr He 

ggc gcg 
Gly Ala 

teg gac 
Ser Asp 

age cgc 
Ser Arg 

gtg gga 
Val Gly 

gga tgc 
Gly Cys 

cgt ggc 
Arg Gly 

gca cct 
Ala Pro 

tec etc 
Ser Leu 



135 
gec cat 
Ala His 

150 
ccc ggc 
Pro Gly 

165 
ggc gec 
Gly Ala 

180 
egg cag 
Arg Gin 

195 
ecc gec 
Pro Ala 

210 
gca gec 
Ala Ala 

225 
ctt egg 
Leu Arg 

240 
ggt gag 
Gly Glu 

255 
cgc ate 
Arg He 

270 
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acc cag 
Thr Gin 

etc tec 
Leu Ser 

etc etc 
Leu Leu 

tgg tgc 
Trp Cys 

ata gca 
He Ala 

cct tgc 
Pro Cys 

cag tgc 
Gin Cys 

ggt get 
Gly Ala 

aag acc 
Lys Thr 



etc acg 
Leu Thr 

tgc cac 
Cys His 

gtt cct 
Val Pro 

tec aag 
Ser Lys 

gca gtg 
Ala Val 

tec cgc 
Ser Arg 

aac aac 
Asn Asn 

gac etc 
Asp Leu 

cag ctg 
Gin Leu 



ggc cag ccg ctg 



tec ccc 
Ser Pro 
275 

aca gac 
Thr Asp 
290 

etc ctg 
Leu Leu 
305 

ggt cgc 
Gly Arg 
320 

cat ggg 
His Gly 
335 

tec tgc 
Ser Cys 
350 

ccc aga 
Pro Arg 
365 

cag gee 
Gin Ala 
380 

gag ttc 
Glu Phe 
395 

cgc tec 



caa acg 
Gin Thr 

ccg ctg 
Pro Leu 

gat ggg 
Asp Gly 

tgc cgc 
Cys Arg 

cgc tgg 
Arg Trp 

gga gga 
Gly Gly 

cct gee 
Pro Ala 

gag atg 
Glu Met 

atg teg 
Met Ser 

tec cct 



tgc atg gat 
Cys Met Asp 

280 
gac caa age 
Asp Gin Ser 

295 
aca gaa tgt 
Thr Glu Cys 

310 
tec ctg gtg 
Ser Leu Val 

325 
tct age tgg 
Ser Ser Trp 

340 
ggt gtg gtc 
Gly Val Val 

355 
ttt ggg ggg 
Phe Gly Gly 

370 
tgc aac act 
Cys Asn Thr 

385 
caa cag tgc 
Gin Gin Cys 

400 
ggc ggc gee 



atg tgc 
Met Cys 

age tgc 
Ser Cys 

ggc gtg 
Gly Val 

gag ctg 
Glu Leu 

ggt ccc 
Gly Pro 

acc agg 
Thr Arg 

cgt gca 
Arg Ala 

cag gee 
Gin Ala 

gec agg 
Ala Arg 

tec ttc 



cag gec 
Gin Ala 

285 
age cgc 
Ser Arg 

300 
gag aag 
Glu Lys 

315 
acc ccc 
Thr Pro 

330 
cga agt 
Arg Ser. 

345 
agg egg 
Arg Arg 

360 
tgt gtt 
Cys Val 

375 
tgc gag 
Cys Glu 

390 
acc gac 
Thr Asp 

405 
tac cac 
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Gly Gin Pro Leu Arg Ser Ser Pro Gly Gly Ala Ser Phe Tyr His 

410 415 420 

tgg ggt get get gta cca cac age caa ggg gat get ctg tgc aga 
Trp Gly Ala Ala Val Pro His Ser Gin Gly Asp Ala Leu Cys Arg 

425 430 435 

cac atg tgc egg gee att ggc gag age ttc ate atg aag cgt gga 
His Met Cys Arg Ala lie Gly Glu Ser Phe He Met Lys Arg Gly 

440 445 450 

gac age ttc etc gat ggg ace egg tgt atg cca agt ggc ccc egg 
Asp Ser Phe Leu Asp Gly Thr Arg Cys Met Pro Ser Gly Pro Arg 

455 460 465 

gag gac ggg ace ctg age ctg tgt gtg teg ggc age tgc agg aca 
Glu Asp Gly Thr Leu Ser Leu Cys Val Ser Gly Ser Cys Arg Thr 

470 475 480 

ttt ggc tgt gat ggt agg atg gac tec cag cag gta tgg gac agg 
Phe Gly Cys Asp Gly Arg Met Asp Ser Gin Gin Val Trp Asp Arg 

485 490 495 

tgc cag gtg tgt ggt ggg gac aac age acg tgc age cca egg aag 
Cys Gin Val Cys Gly Gly Asp Asn Ser Thr Cys Ser Pro Arg Lys 

500 505 510 

ggc tct ttc aca get ggc aga gcg aga gaa tat gtc acg ttt ctg 
Gly Ser Phe Thr Ala Gly Arg Ala Arg Glu Tyr Val Thr Phe Leu 

515 520 525 

aca gtt ace ccc aac ctg ace agt gtc tac att gec aac cac agg 
Thr Val Thr Pro Asn Leu Thr Ser Val Tyr He Ala Asn His Arg 

530 535 540 

cct etc ttc aca cac ttg gcg gtg agg ate gga ggg cgc tat gtc 
Pro Leu Phe Thr His Leu Ala Val Arg lie Gly Gly Arg Tyr Val 
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545 550 555 

gtg get ggg aag atg age ate tec cct aac acc acc tac ccc tec 

Val Ala Gly Lys Met Ser He Ser Pro Asn Thr Thr Tyr Pro Ser 

• 560 565 570 

etc ctg gag gat ggt cgt gtc gag tac aga gtg gee etc acc gag 

Leu Leu Glu Asp Gly Arg Val Glu Tyr Arg Val Ala Leu Thr Glu 

575 580 585 

gac egg ctg ccc cgc ctg gag gag ate cgc ate tgg gga ccc etc 

Asp Arg Leu Pro Arg Leu Glu Glu He Arg He Trp Gly Pro Leu 

590 595 600 

cag gaa gat get gac ate cag gtt tac agg egg tat ggc gag gag 

Gin Glu Asp Ala Asp He Gin Val Tyr Arg Arg Tyr Gly Glu Glu 

605 610 615 

tat. ggc aac etc acc cgc cca gac ate acc ttc acc tac ttc cag 

Tyr Gly Asn Leu Thr Arg Pro Asp lie Thr Phe Thr Tyr Phe Gin 

620 625 630 

cct aag cca egg cag gec tgg gtg tgg gec get gtg cgt ggg ccc 

Pro Lys Pro Arg Gin Ala Trp Val Trp Ala Ala Val Arg Gly Pro 

635 640 645 

tgc teg gtg age tgt ggg gca ggg ctg cgc tgg gta aac tac age 

Cys Ser Val Ser Cys Gly Ala Gly Leu Arg Trp Val Asn Tyr Ser 

650 655 660 

tgc ctg gac cag gec agg aag gag ttg gtg gag act gtc cag tgc 

Cys Leu Asp Gin Ala Arg Lys Glu Leu Val Glu Thr Val Gin Cys 

665 670 675 
caa ggg age cag cag cca cca gcg tgg cca gag gee tgc gtg etc 

Gin Gly Ser Gin Gin Pro Pro Ala Trp Pro Glu Ala Cys Val Leu 

680 685 690 
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gaa ccc tgc cct ccc tac tgg gcg gtg gga gac ttc ggc cca tgc 

Glu Pro Cys Pro Pro Tyr Trp Ala Val Gly Asp Phe Gly Pro Cys 

695 700 705 

age gec tec tgt ggg ggc ggc ctg egg gag egg cca gtg cgc tgc 

Ser Ala Ser Cys Gly Gly Gly Leu Arg Glu Arg Pro Val Arg Cys 

710 715 720 

gtg gag gee cag ggc age etc ctg aag aca ttg ccc cca gec egg 

Val Glu Ala Gin Gly Ser Leu Leu Lys Thr Leu Pro Pro Ala Arg 

725 730 735 

tgc aga gca ggg gee cag cag cca get gtg gcg ctg gaa acc tgc 

Cys Arg Ala Gly Ala Gin Gin Pro Ala Val Ala Leu Glu Thr Cys 

740 745 750 

aac ccc cag ccc tgc cct gee agg tgg gag gtg tea gag ccc age 

Asn Pro Gin Pro Cys Pro Ala Arg Trp Glu Val. Ser Glu Pro Ser 

755 760 765 

tea tgc aca tea get ggt gga gca ggc ctg gec ttg gag aac gag 

Ser Cys Thr Ser Ala Gly Gly Ala Gly Leu Ala Leu Glu Asn Glu 

770 775 780 

acc tgt gtg cca ggg gca gat ggc ctg gag get cca gtg act gag 

Thr Cys Val Pro Gly Ala Asp Gly Leu Glu Ala Pro Val Thr Glu 

785 790 795 

ggg cct ggc tec gta gat gag aag ctg cct gec cct gag ccc tgt 

Gly Pro Gly Ser Val Asp Glu Lys Leu Pro Ala Pro Glu Pro Cys 

800 . 805 810 

gtc ggg atg tea tgt cct cca ggc tgg ggc cat ctg gat gec acc 

Val Gly Met Ser Cys Pro Pro Gly Trp Gly His Leu Asp Ala Thr 

815 820 825 

tct gca ggg gag aag get ccc tec cca tgg ggc age ate agg acg 
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Ser Ala Gly Glu Lys Ala Pro Ser Pro Trp Gly Ser He Arg Thr 
5 830 835 840 

ggg get caa get gca cac gtg tgg ace cct gcg gca ggg teg tgc 2565 
Gly Ala Gin Ala Ala His Val Trp Thr Pro Ala Ala Gly Ser Cys 
845 850 855 

tec gtc tec tgc ggg cga ggt ctg atg gag ctg cgt ttc ctg tgc 2610 
15 Ser Val Ser Cys Gly Arg Gly Leu Met Glu Leu Arg Phe Leu Cys 

860 865 870 

atg gac tct gec etc agg gtg cct gtc cag gaa gag ctg tgt ggc 2655 
20 Met Asp Ser Ala Leu Arg Val Pro Val Gin Glu Glu Leu Cys Gly 

875 880 885 

ctg gca age aag cct ggg age egg egg gag gtc tgc cag get gtc 2700 

25 

Leu Ala Ser Lys Pro Gly Ser Arg Arg Glu Val Cys Gin Ala Val 
890 .895 . 900 

30 ccg tgc cct get egg tgg cag tac aag ctg gcg gec tgc age gtg 2745 

Pro Cys Pro Ala Arg Trp Gin Tyr Lys Leu Ala Ala Cys Ser Val 
905 910 915 

35 

age tgt ggg aga ggg gtc gtg egg agg ate ctg tat tgt gee egg 2790 
Ser Cys Gly Arg Gly Val Val Arg Arg He Leu Tyr Cys Ala Arg 

920 925 930 

gee cat ggg gag gac gat ggt gag gag ate ctg ttg gac acc cag 2835 
Ala His Gly Glu Asp Asp Gly Glu Glu He Leu Leu Asp Thr Gin 

935 940 945 

tgc cag ggg ctg cct cgc ccg gaa ccc cag gag gee tgc age ctg 2880 
Cys Gin Gly Leu Pro Arg Pro Glu Pro Gin Glu Ala Cys Ser Leu 

50 

950 955 960 

gag ccc tgc cca cct agg tgg aaa gtc atg tec ctt ggc cca tgt 2925 
55 Glu Pro Cys Pro Pro Arg Trp Lys Val Met Ser Leu Gly Pro Cys 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



teg gec age tgt 
Ser Ala Ser Cys 

gtg cag etc gac 
Val Gin Leu Asp 

tgt gcg gcg ctg 
Cys Ala Ala Leu 

gee gac tgc acc 
Ala Asp Cys Thr 

tct gtt tec tgt 
Ser Val Ser Cys 

etc gga ccc cag 
Leu Gly Pro Gin 

cac ttg ccc aag 
His Leu Pro Lys 

tgt gtg gga cag 
Cys Yal Gly Gin 

gga acc att gac 
Gly Thr He Asp 



55 



965 

ggc ctt ggc 
Gly Leu Gly 
980 

caa ggc cag 
Gin Gly Gin 
995 

gtg egg ccc 
Val Arg Pro 
1010 

tac cgc tgg 
Tyr Arg Trp 
1025 

ggg gat ggc 
Gly Asp Gly 
1040 

gec cag gcg 
Ala Gin Ala 
1055 

ccg gtg act 
Pro Val Thr 
1070 

ggt gec tgt 
Gly Ala Cys 
1085 

atg cga ggc 
Met Arg Gly 
1100 



act get 
Thr Ala 

gac gtg 
Asp Val 

gag gec 
Glu Ala 

cat gtt 
His Val 

ate cag 
He Gin 

cct gtg 
Pro Val 

gtg cgt 
Val Arg 

ggc agg 
Gly Arg 

cca ggg 
Pro Gly 



970 

aga cgc 
Arg Arg 
985 

gag gtg 
Glu Val 
1000 
agt gtc 
Ser Val 
1015 
ggc acc 
Gly Thr 
1030 
cgc egg 
Arg Arg 
1045 
cca get 
Pro Ala 
1060 
ggc tgc 
Gly Cys 
1075 
cag cac 
Gin His 
1090 
cag gca 
Gin Ala 
1105 



teg gtg 
Ser Val 

gac gag 
Asp Glu 

ccc tgt 
Pro Cys 

tgg atg 
Trp Met 

cgt gac 
Arg Asp 

gat ttc 
Asp Phe 

tgg get 
Trp Ala 

ctt gag 
Leu Glu 

gac tgt 
Asp Cys 



975 
gec tgt 
Ala Cys 

990 
gcg gee 
Ala Ala 

1005 
etc att 
Leu lie 

1020 
gag tgc 
Glu Cys 

1035 
acc tgc 
Thr Cys 

1050 
tgc cag 
Cys Gin 

1065 
ggg ccc 
Gly Pro 

1080 
cca aca 
Pro Thr 

1095 
gca gtg 
Ala Val 

1110 



2970 



3015 



3060 



3105 



3150 



3195 



3240 



3285 



3330 
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gcc att ggg egg ccc etc ggg gag gtg gtg acc etc cgc gtc ctt 3375 
5 Ala He Gly Arg Pro Leu Gly Glu Val Val Thr Leu Arg Val Leu 

1115 1120 1125 

gag agt tct etc aac tgc agt gcg ggg gac atg ttg ctg ctt tgg 3420 
10 Glu Ser Ser Leu Asn Cys Ser Ala Gly Asp Met Leu Leu Leu Trp 

H30 1135 1140 

ggc egg etc acc tgg agg aag atg tgc agg aag ctg ttg gac atg 3465 
Gly Arg Leu Thr Trp Arg Lys Met Cys Arg Lys Leu Leu Asp Met 
1145 1150 1155 

20 act ttc age tec aag acc aac acg ctg gtg gtg agg cag cgc tgc 3510 

Thr Phe Ser Ser Lys Thr Asn Thr Leu Val Val Arg Gin Arg Cys 
1160 1165 1170 

25 

ggg egg cca gga ggt ggg gtg ctg ctg egg tat ggg age cag ctt 3555 
Gly Arg Pro Gly Gly Gly Val Leu Leu Arg Tyr Gly Ser .Gin Leu 
so 1175 1180 1185 

get cct gaa acc ttc tac aga gaa tgt gac atg cag etc ttt ggg 3600 
Ala Pro Glu Thr Phe Tyr Arg Glu Cys Asp Met Gin Leu Phe Gly 
1190 1195 1200 

ccc tgg ggt gaa ate gtg age ccc teg ctg agt cca gcc acg agt 3645 
Pro Trp Gly Glu He Val Ser Pro Ser Leu Ser Pro Ala Thr Ser 

40 

1205 1210 1215 

aat gca ggg ggc tgc egg etc ttc att aat gtg get ccg cac gca 3690 
45 Asn Ala Gly Gly Cys Arg Leu Phe He Asn Val Ala Pro His Ala 

1220 1225 1230 

egg att gcc ate cat gcc ctg gcc acc aac atg ggc get ggg acc 3735 

50 

Arg He Ala lie His Ala Leu Ala Thr Asn Met Gly Ala Gly Thr 
1235 1240 1245 

55 gag gga gcc aat gcc age tac ate ttg ate egg gac acc cac age 3780 
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Glu Gly Ala Asn Ala Ser Tyr He Leu He Arg Asp Thr His Ser 

1250 1255 1260 

ttg agg acc aca gcg ttc cat ggg cag cag gtg etc tac tgg gag 

Leu Arg Thr Thr Ala Phe His Gly Gin Gin Val Leu Tyr Trp Glu 

1265 . 1270 1275 

tea gag age age cag get gag atg gag ttc age gag ggc ttc ctg 

Ser Glu Ser Ser Gin Ala Glu Met Glu Phe Ser Glu Gly Phe Leu 

1280 1285 1290 

aag get cag gee age ctg egg ggc cag tac tgg acc etc caa tea 

Lys Ala Gin Ala Ser Leu Arg Gly Gin Tyr Trp Thr Leu Gin Ser 

1295 1300 1305 

t.gg gta ceg gag atg cag gac cct cag tec tgg aag gga aag gaa 

Trp Val Pro Glu Met Gin Asp Pro Gin Ser Trp Lys Gly Lys Glu 

1310 . 1315 . 1320 

gga acc 
Gly Thr 

<210>20 

<211>312 

<212>PRT 

<213> Homo sapiens 
<400>20 

get gca ggc ggc ate eta cac ctg gag ctg ctg gtg gee gtg ggc 

Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Val Gly 
15 10 15 

ccc gat gtc ttc cag get cac cag gag gac aca gag cgc tat gtg 

Pro Asp Val Phe Gin Ala His Gin Glu Asp Thr Glu Arg Tyr Val 
20 25 30 
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etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ccg tec 

Leu Thr Asn Leu Asn He Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 

Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val He Leu 

50 55 60 

aca gag cct gag ggt get cca aat ate aca gee aac etc acc teg 

Thr Glu Pro Glu Gly Ala Pro Asn He Thr Ala Asn Leu Thr Ser 

65 70 75 

tec ctg ctg age gtc tgt ggg tgg age cag acc ate aac cct gag 

Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 

Asp Asp Thr Asp Pro Gly His Ala Asp Leu .Val. Leu Tyr He Thr 

95 100 105 

agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 

Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 

110 115 120 

gtc acc cag ctg ggc ggt gee tgc tec cca acc tgg age tgc etc 

Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 

125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc acc att gee cat 

He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr He Ala His 

140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac ggc gcg ccc ggc 

Glu He Gly His Ser Phe Gly Leu Glu His Asp Gly Ala Pro Gly 

155 160 165 

age ggc tgc ggc ccc age gga cac gtg atg get teg gac ggc gee 
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n „ _ 

Ser 


Gly 


n 

Cys 


Gly 


Pro 


Ser 


Gly 


His Val 


Met 


Ala 


Ser 


Asp 


lily 


Ala 










170 








175 










1 oU 


gcg 


ccc 


cgc 


gec 


ggc 


etc 


gee 


tgg tec 


ccc 


tgc 


age 


cgc 


egg 


cag 


Ala 


Pro 


Arg 


A 1 

Ala 


Gly 


Leu 


Ala 


Trp Ser 


Pro 


Cys 


Ser 


Arg 


Arg 


Gin 










1 oc 

185 








190 










195 


ctg 


ctg 


age 


ctg 


etc 


agg 


acg 


ggc gcg 


ctg 


cgt 


gtg 


gga 


ccc 


gec 


Leu 


Leu 


Ser 


Leu 


Leu 


Arg 


Thr 


Gly Ala 


Leu 


Arg 


Val 


Gly 


Pro 


Ala 










200 








205 










210 


gcg 


gec 


tea 


acc 


egg 


gtc 


cgc 


ggg gca 


ccc 


gee 


gga 


tgc 


gca 


gec 


Ala 


Ala 


Ser 


Thr 


Arg 


Val 


Arg 


Gly Ala 


Pro 


Ala 


Gly 


Cys 


Ala 


Ala 










215 








220 










225 


tgg 


n r> t 


V/ l a 


eta 


cag 


cgc 


caa 


cga gca 


gig 


ccg 


cgt 


ggc 


ctt 


egg 


Trp 


Pro 


Leu 


Leu 


Gin 


Arg 


Gin 


Arg Ala 


Val 


Pro 


Arg 


Gly 


Leu 


Arg 




- 






230 








235 






- 




240 


ccc 


caa 


ggc 


tgt 


cgc 


ctg 


cac 


ctt cgc 


cag 


gga 


gca 


cct 


gga 


tat 


Pro 


Gin 


Gly 


Cys 


Arg 


Leu 


His 


Leu Arg 


Gin 


Gly 


Ala 


Pro 


Gly 


Tyr 










245 








250 










255 


gtg 


cca 


ggc 


cct 


etc 


ctg 


cca 


cac aga 


ccc 


got 


gga 


cca 


aag 


cag 


Val 


Pro 


Gly 


Pro 


Leu 


Leu 


Pro 


His Arg 


Pro 


Ala 


Gly 


Pro 


Lys 


Gin 










260 








265 










270 


ctg 


cag 


ccg 


cct 


cct 


cgt 


tec 


tct cct 


gga 


tgg 


gac 


aga 


atg 


tgg 


Leu 


Gin 


Pro 


Pro 


Pro 


Arg 


Ser 


Ser Pro 


Gly 


Trp 


Asp 


Arg 


Met 


Trp 










275 








280 










285 


cgt 


gga 


gaa 


gtg 


gtg 


etc 


caa 


ggg teg 


ctg 


ccg 


etc 


cct 


ggt 


gga 


Arg 


Gly 


Glu 


Val 


Val 


Leu 


Gin 


Gly Ser 


Leu 


Pro 


Leu 


Pro 


Gly 


Gly 










290 








295 










300 


get 


gac 


ccc 


cat 


age 


age 


agt 


gca tgg 


gcg 


ctg 


gtc 








Ala 


Asp 


Pro 


His 


Ser 


Ser 


Ser 


Ala Trp 


Ala 


Leu 


Val 
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305 310 

<210>21 

<211>270 

<212>PRT 

<213> Homo sapiens 
<400>21 

get gca ggc ggc ate eta cac ctg gag ctg ctg gtg gee gtg ggc 

Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Val Gly 

1 5 10 15 

ccc gat gtc ttc cag get cac cag gag gac aca gag cgc tat gtg 

Pro Asp Val Phe Gin Ala His Gin Glu Asp Thr Glu Arg Tyr Val 

20 25 30 

etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ccg tec 

Leu Thr Asn Leu Asn. He Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 

Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val He Leu 

50 55 60 

aca gag cct gag ggt get cca aat ate aca gec aac etc ace teg 

Thr Glu Pro Glu Gly Ala Pro Asn He Thr Ala Asn Leu Thr Ser 

65 70 75 

tec ctg ctg age gtc tgt ggg tgg age cag acc ate aac cct gag 

Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 

Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He Thr 

95 100 105 
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agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 
Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 

110 115 120 

gtc ace cag ctg ggc ggt gee tgc tec cca ace tgg age tgc etc 
Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 

125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc ace att gee cat 
He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr lie Ala His 

140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac ggc gcg ccc ggc 
Glu He Gly His Ser Phe Gly Leu Glu His Asp Gly Ala Pro Gly 

155 160 165 

age ggc tgc ggc ccc age gga cac gtg atg get teg gac ggc gec 
Ser Gly Cys Gly Pro Ser Gly His Val Met Ala Ser Asp Gly Ala 

170 175 180 

gcg ccc cgc gee ggc etc gee tgg tec ccc tgc age cgc egg cag 
Ala Pro Arg Ala Gly Leu Ala Trp Ser Pro Cys Ser Arg Arg Gin 

185 190 195 

ctg ctg age ctg etc aga ccc gtc cct ccg teg ccg etc cct ctg 
Leu Leu Ser Leu Leu Arg Pro Val Pro Pro Ser Pro Leu Pro Leu 

200 205 210 

ctg gee acc cac etc tgc gee ggc agg age ctt agt ctt ggt ccc 
Leu Ala Thr His Leu Cys Ala Gly Arg Ser Leu Ser Leu Gly Pro 

215 220 225 

age caa gag ccg get cct ggt ggg ggg cgc ggg ccg aga act cct 
Ser Gin Glu Pro Ala Pro Gly Gly Gly Arg Gly Pro Arg Thr Pro 

230 235 240 

gtt ccc act cac aaa agg cca cgc ttc caa acg ctt cca tec teg 
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Val Pro Thr His Lys Arg Pro Arg Phe Gin Thr Leu Pro Ser Ser 
245 250 255 

tgc cca etc etc cgt ccc gec tec tec egg tgt aca ccc egg gac 
Cys Pro Leu Leu Arg Pro Ala Ser Ser Arg Cys Thr Pro Arg Asp 
260 265 270 

<210>22 
<211>43 
<212>DNA 

<213> Homo sapiens 
<400>22 

ggactcgagc caccaatgca ccagcgtcac ccccgggcaa gat 

<210>23 
<211>45 
<212>DNA 

<213> Homo sapiens 
<400>23 

tccgtcgact cattatcagg ttccttcctt tcccttccag gactg 

<210>24 
<211>30 
.<212>DNA 

<213> Homo sapiens 
<400>24 

ggttggcaat gtagacactg gtcaggttgg 

<210>25. 
<211>30 
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<212>DNA 

<2l3> Homo sapiens 
<400>25 

ccaacctgac cagtgtctac attgccaacc 



<210>26 
<211>30 
<212>DNA 

<213> Homo sapiens 
<400>26 

ctttccacct aggtgggcag ggctccaggc 



<210>27 
<2 1 1>30 
<212>DNA 

<213> Homo sapiens 
<400>27 . 

gcctggagcc ctgcccacct aggtggaaag 

<210>28 
<211>33 
<212>DNA 

<213> Homo sapiens 
<400>28 

tcgagaaaaa gtctacgggg gcctaggttt tta 



<210>29 
<211>33 
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<212>DNA 

<213> Homo sapiens 
<400>29 

agcttaaaaa cctaggcccc cgtagacttt ttc 

<210>30 
<211>30 
<212>DNA 

<213> Homo sapiens 
<4O0>30 

tcggccatgg ccgcaggcgg catcctacac 
<210>31 

<211>28 

<212>DNA 

<213> Homo sapiens 
<4O0>31 

ggcaagctta tcagcggggc gcggcgcc 

<210>32 

<211>564 

<212>DNA 

<213> Homo sapiens 
<400>32 

ccatggccgc aggcggcatc ctacacctgg agctgctggt ggccgtgggc cccgatgtct 
tccaggctca ccaggaggac acagagcgct atgtgctcac caacctcaac atcggggcag 
aactgcttcg ggacccgtcc ctgggggctc agtttcgggt gcacctggtg aagatggtca 
ttctgacaga gcctgagggt gctccaaata tcacagccaa cctcacctcg tccctgctga 
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gcgtctgtgg gtggagccag accatcaacc ctgaggacga cacggatcct ggccatgctg 
acctggtcct ctatatcact aggtttgacc tggagttgcc tgatggtaac cggcaggtgc 
ggggcgtcac ccagctgggc ggtgcctgct ccccaacctg gagctgcctc attaccgagg 
acactggctt cgacctggga gtcaccattg cccatgagat tgggcacagc ttcggcctgg 
agcacgacgg cgcgcccggc agcggctgcg gccccagcgg acacgtgatg gcttcggacg 
gcgccgcgcc ccgctgataa gctt 

<210>33 

<211>184 

<212>PRT 

<213> Homo sapiens 
<400>33 

Met Ala Ala Gly Gly He Leu His Leu Glu Leu Leu Val Ala Yal 

1 5 10 15 

Gly Pro Asp Yal Phe Gin Ala His Gin Glu Asp Thr Glu Arg Tyr 

20 25 30 

Val Leu Thr Asn Leu Asn He Gly Ala Glu Leu Leu Arg Asp Pro 

35 40 45 

Ser Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val lie 

50 55 60 

Leu Thr Glu Pro Glu Gly Ala Pro Asn lie Thr Ala Asn Leu Thr 

65 70 75 

Ser Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro 

80 85 90 

Glu Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He 

95 100 105 

Thr Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg 

110 115 120 
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Gly Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys 

125 130 135 

Leu He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr lie Ala 

140 145 . 150 

His Glu He Gly His Ser Phe Gly Leu Glu His Asp Gly Ala Pro 

155 160 165 

Gly Ser Gly Cys Gly Pro Ser Gly His Val Met Ala Ser Asp Gly 

170 175 180 

Ala Ala Pro Arg 



<210>34 
<211>2529 
<212>DNA 
30 <213> Mus musculus 

<400>34 

atgagccagc tttgcctgtg gttgacgtgc cagccttgtt atgctgtcag 50 

35 

tgtcagagga atcctcactg gtgccatctt cattctgggc tgctgggggc 1.00 
tctctgactt ccagaagagt cttcttcaag atctggagcc caaggatgtg 150 
40 tcttcttact ttggccacca tgctgctcca ttcacaggcc atcctccctc 200 

tcacctccag agactgagac ggagaaggac tttggaggac attctgcacc 250 
tggaactcct ggtagctgtg ggccccgatg tttcccgggc tcatcaggag 300 

45 

gacacagaac gctacgtgct cactaatctc aatatcgggt cagaactgtt 350 
gagaaaccca tccctgggag tccagttcca ggtgcacctg gtgaagctaa 400 
50 tcaccctctc tgactcagag agtactccga atatcacggc caacatcacc 450 

tcatccttga tgagcgtctg cgagtggagc cagacgatca acccccacga 500 
tgacagggat ccaagtcacg ctgacctgat tctctatatc accagcaacg 550 

55 

tggctggtgc cactgtcctt gtgattcatt ttctcttatc aaggtttgac 600 
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ctggagttgc ctgatggcaa ccagcaggtt 

5 aggtgcctgc tccctttcct ggagttgcct 

ttgacctggg ggtcaccatc gcccatgaga 

gaccatgatg gtgctccagg tagtggcagc 

10 

cgtgatggcg gctgatggcg caacacctac 

ctgcctgcag ccaaaggcag ttgcagcacc 

15 cactgcttcc aggacccacc tgggctgcag 

gctgatggca cagcctggcc tctactacag 

tggctttcgg ttctggggct gtcgcctgca 

20 

aacacagcac tcagtggtcc ttccaccttg 

ccagaagtcc tggatggctc ctgaagctct 

25 aatccgacat ctggtctctg ggctgcatca 

tccttcctga acgacacaga agccatgcaa 

tcatccaggc agcctgaagc ccatcctgaa 

30 tccctggtac agatgtctac tatttgcttc 

aacccctccg atcgactggc aatcaaggat 

gagcaactcc ttcaaaagct cctctgttgc 

35 

aggtccccat cttcatcact gacgtgctgc 

atcttaggtg atggcagctg gctgtgtgct 
40 gcactgtgac tcagggattg gctcgcagag 

cagtctcttg gacagagcac cctctgaaag 

agtcgaccag aggtccagct cagagccatt 

45 

agaggaccag ctagcactgg caaaggaccc 

gcagtttgat catctccttc ctgatggata 

50 tctgaaaggc ttgttaatgt ggtctacaac 
ccaaggacag atctcagaag agctggaaga 

cccaagagaa cctggagcac ttccaagagg 

55 

atcctgagcc tgctctggtc cctcctggta 



cggggtgtca 


cccagctggg 


650 


tatcactgag 


gatactggct 


700 


ttgggcacag 


cttcgggctg 


750 


acctgcaagg 


ccagtggcca 


800 


tggagggacc 


ctggagtggt 


850 


tactcagcac 


agggcaaatg 


900 


tcaggactta 


cacggcacca 


950 


tgcagatgat 


cagtgccgtg 


1000 


ccttctccag 


ggagggtctg 


1050 


atcctgtccg 


cagacccctg 


1100 


caaattctcc 


ttctccacca 


1150 


ttetagacat 


ggccacttgc 


1200 


ctgcggaagg 


ccatccgcca 


1250 


aaccatggag 


gagaagcaaa 


1300 


tgcccttcat 


gttgcatatc 


1350 


gtgatgcaag 


tcaccttcat 


1400 


gctgaatatg 


cagcggcaga 


1450 


t tgaaggcaa 


catggccaac 


1500 


tcctttgtga 


acgacagcag 


1550 


acttgggttt 


gatt ttcagt 


1600 


atgtcatgca 


gaat ttctcc 


1650 


aacaagttgt 


tgacaatgcc 


1700 


agaagctgag 


atcccaagga 


1750 


ccttgcggag 


ccatcctaac 


1800 


gtgcttgcca 


ttatttccag 


1850 


ggaggggttg 


tttcagcttg 


1900 


acagggacat 


ctgcctctct 


1950 


gatgttgtca 


ctgtggacaa 


2000 
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.agagcccttg gagcagctct ctggcatggt cacctgggtg ctggctactc 2050 

atccggagga cgtggaaata gcagaggctg gctgtgcggt gctctggctg 2100 

ctgtccttgt tgggctgcat aaaggagagt cagtttgagc aggtggtagt 2150 

gctgctcctg agaagcatcc agctgtgccc tggcagagta ctgctggtga 2200 

acaatgcatt ccgtggcttg gccagcctcg caaaggtgtc cggcccaccc 2250 

tcacagttag agccaaatga ctgggtatcc agccccagcc cccttttgtg 2300 

gaatcagaga cttcactatg tgaacaagca aaagctgttc atgcctctgt 2350 

gggtgctgag gcaagagcac cctcattact gctgtgctaa tgaccctaca 2400 

tcagagcaca tccaggcagt actaagtgga ctaaatgggt ttgaaaagaa 2450 

gcacagttgt gtggaatctt gtgtggaatg tggctgcagg cagcaggaga 2500 

agaatagagg aggagcccca gggatttga 2529 



<210>35 
<2 11>2514 . 
<212>DNA 

<213> Mus musculus 
<400>35 



aggaagctcc caagagtaaa cactgcctga tgtcccgccc agccagcaag 50 

tgaacattgc acactaacca gaatcccagt cactagggct cctgtccggc 100 

catcaactgc cttttctaaa gatgagccag ctttgcctgt ggttgacgtg 150 

ccagccttgt tatgctgtca gtgtcagagg aatcctcact ggtgccatct 200 

tcattctggg ctgctggggg ctctctgact tccagaagag tcttcttcaa 250 

gatctggagc ccaaggatgt gtcttct tac tttggccacc atgctgctcc 300 

attcacaggc catcctccct ctcacctcca gagactgaga cggagaagga 350 

ctttggagga cattctgcac ctggaactcc tggtagctgt gggccccgat 400 

gtttcccggg ctcatcagga ggacacagaa cgctacgtgc tcactaatct 450 

caatatcggg tcagaactgt tgagaaaccc atccctggga gtccagttcc 500 

aggtgcacct ggtgaagcta atcaccctct ctgactcaga gagtactccg 550 
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aatatcacgg ccaacatcac ctcatccttg 
ccagacgatc aacccccacg atgacaggga 

5 

ttctctatat caccaggttt gacctggagt 
gttcggggtg tcacccagct gggaggtgcc 
10 ccttatcact gaggatactg gctttgacct 

agattgggca cagcttcggg ctggaccatg 
agcacctgca aggccagtgg ccacgtgatg 

15 

tactggaggg accctggagt ggtctgcctg 
acctactcag cacagggcag atgcactgct 
20 cagtcaggac ttacacggca ccagctgatg 

cagtgcagat gatcagtgcc gtgtggcttt 
gcaccttctc cagggagggt ctggatgtat 

25 

acagaccccc tggaccaaag cagctgcagc 
ggatgggaca ggatgtggtg tggagaagtg 
30 gctccctagc tgagctggct cctgtggctg 

agctggggcc cccatagtcc ctgctcccga 
taccaggagg cggtggtgca acaaccccag 

35 

catgtgtggg tgaagacctc caggctaaga 
gagaagactc agctggagtt catgtccgag 
40 acaaccactg caactttccc aaggcactgc 

ctgctgtgca gtatagtcaa ggagataccc 
gctgttggag aaagcttcat tgtcagccgt 
gacccgttgt gtgccaagtg gtccccagga 
gtttgttggg cagctgcagg acctttggct 
cagaaggttt gggatgcgtg ccaggtgtgt 

50 

cagctcacgg aatggttctt tcacagctgg 
cgttcctgat tgttactccc aacatgacca 
55 aggcctctct tcacacactt ggcggtgagg 



O t CT 1 CT C CT f C* f 

aigagcgic i 


fT/»<TO <7 f Ct CT ^ CT 

gcgagtggag 


enn 
oUU 


tccaagicac 


etc* \ cr*i c* c* f ci o 

gcigacc iga 


ccn 
DDU 


f CT C* C* f CT O t CTCT 

igcctgaigg 


caaccagcag 


7nn 


f cr c* f /* r> c* t f f 

IgC ICCC l I I 


r+C* k CTCT1 CT f f IT 

cc 1 ggagi ig 


f b(J 


(ret cr fret f 

ggggg tcacc 


O t C* CTC f*f» O t CT 

atcgccca ig 


q An 


r\ f cr rr f etc* f c* c* 

a iggigcicc 


r\ ctct f orrf CTCTC* 

aggiagtggc 


o cn 
obU 


gcggctgatg 


gcgcaacacc 


a An 
iJOO 


cagecaaagg 


cagttgeage 


950 


tccaggaccc 


acctgggctg 


1000 


gcacagcctg 


gcctctacta 


1050 


cggttctggg 


gctgtcgcct 


1100 


gccaggccct 


gtcctgccac 


i 150 


cgcctccttg 


ttcctctcct 


1200 


gtgctccaag 


gctcgctgtc . . 


1250 


cagtacatgg 


acactggtct 


1300 


tcctgtggag 


gaggtgtgat 


1350 


gectgeatt t 


gggggacgtg 


1400 


tgtgcaacac 


gcaggcttgt 


1450 


cagtgtgccc 


agacagacag 


1500 


ctcct tctac 


cactgggatg 


1550 


tgtgcagaca 


catgtgctgg 


1600 


ggggacaggt 


tcctagatgg 


1 C C A 

1650 


tgatgggacc 


ctaagcctct 


1700 


gtgatggcag 


gatggactcc 


i Ten 

1750 


ggaggagaca 


acagcacctg 


1800 


gagagecaga 


gaatatgtca 


1850 


acgcacacat 


tgtcaaccgc 


1900 


atccagggcc 


actacat tgt 


1950 
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ggcagggaag actagcatct cacccaacac cacctaccct tcccttctgg 
aggactaccg tgtggaatac agagtgactc tcactgagga ccagctgccc 
cacttagagg agattcacat ccggggaccc gtccgggatg acattgagat 
tcaggtgtac agacgatatg gaggagaata tggggatctt acacacccag 
acatcacctt ttcctacttt caactgaagc agcaggcagc ctgggtatgg 
accgctaagc gtggaccctg ctcagtgagc tgtggggcag ggctgcgctg 
ggtgacctac agctgccagg atcaagctca agacaagtgg gtaaagaacg 
cccagtgcca agggagccca cagccacctg catggcaaga gccttgtgtc 
tctgccccct gctccccata ttgggtagct ggggacttca gcccatgtag 
cgtgtcttgt ggcgggggcc ttcgggagcg gtcactgcgc tgtgtagaga 
cccaagatgg cttcttaaag acactgccac ctgcccggtg cagagcagta 
gcccagcagc cage 

<210>36 

<211>3512 

<212>DNA 

<213> Mus musculus 
<400>36 

aggaagctcc caagagtaaa cactgcctga tgtcccgccc agecagcaag 
tgaacattgc acactaacca gaatcccagt cactagggct cctgtccggc 
catcaactgc cttttctaaa gatgagecag etttgectgt ggttgacgtg 
ccagccttgt tatgc tgtca gtgtcagagg aatcctcact ggtgccatct 
tcattctggg ctgctggggg ctctctgact tccagaagag tcttcttcaa 
gatctggagc ccaaggatgt gtcttcttac tttggccacc atgctgctcc 
attcacaggc catcctccct ctcacctcca gagactgaga eggagaagga 
ctttggagga cattctgcac ctggaactcc tggtagctgt gggccccgat 
gtttcccggg ctcatcagga ggacacagaa cgctacgtgc tcactaatct 
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caatatcggg tcagaactgt tgagaaaccc atccctggga gtccagttcc 
aggtgcacct ggtgaagcta atcaccctct ctgactcaga gagtactccg 
aatatcacgg ccaacatcac ctcatccttg atgagcgtct gcgagtggag 
ccagacgatc aacccccacg atgacaggga tccaagtcac gctgacctga 
ttctctatat caccaggttt gacctggagt tgcctgatgg caaccagcag 
gttcggggtg tcacccagct gggaggtgcc tgctcccttt cctggagttg 
ccttatcact gaggatactg gctttgacct gggggtcacc atcgcccatg 
agattgggca cagcttcggg ctggaccatg atggtgctcc aggtagtggc 
agcacctgca aggccagtgg ccacgtgatg gcggctgacg gcgcaacacc 
cactggaggg accctggagt ggtctgcctg cagccaaagg cagttgcagc 
acctactcag cacagggcaa atgcactgct tccaggaccc acctgggctg 
cagtcaggac ttaeaeggca ccagetgatg gcacagcctg gcctctacta 
cagtgcagat gatcagtgcc gtgtggcttt cggttctggg gctgtcgcct 
gcaccttctc cagggagggt ctggatgtat gccaggccct gtcctgccac 
acagacccct tggaccaaag cagctgcagc cgcctccttg ttcctctcct 
ggatgggaca gaatgtggtg tggagaagtg gtgctccaag gctcgctgtc 
gctccctagc tgagctggct cctgtggctg cagtacatgg acactggtct 
agctggggcc cccatagtcc ctgctcccga tcctgtggag gaggtgtgat 
taccaggagg cggtggtgca acaaccccag gcctgcattt gggggacgtg 
catgtgtggg tgaagacctc caggctaaga tgtgcaacac gcaggcttgt 
gagaagactc agctggagtt catgtccgag cagtgtgccc agacagacag 
acaaccactg caactttccc aaggcactgc ctccttctac cactgggatg 
ctgctgtgca gtatagtcaa ggagataccc tgtgcagaca catgtgctgg 
gctgttggag aaagcttcat tgtcagccgt ggggacaggt tcctagatgg 
gacccgttgt gtgccaagtg gtcctcagga tgatgggacc ctaagcctct 
gtttgttggg cagctgcagg acctttggct gtgatggcag gatggactcc 
cagaaggttt gggatgcgtg ccaggtgtgt ggaggagaca acagcacctg 
cagctcacgg aatggttctt tcacagctgg gagagccaga gaatatgtca 
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gttggtaaac aaataatctg cgcatgagcc aagggtattt acgactactt 
gtactctgtt tttcccgtga acgtcagctc ggccatgggc tgcagccaat 
cagggagtga tgcgtcctag gcaattgttg ttctctttta aatagaaggg 
gtttcgtttt tctctttttc ttgcttctta cactctggcc ccaaaaagat 
gtaagcaata aagctttgcc gtaggaaaaa aaaaaaaaaa ggatccggta 
cctctagatc ag 

<210>37 
<211>22 
<212>PRT 

<2l3> Homo sapiens 
<400>37 

Phe. Ser Pro Ala. Pro Gin Pro Arg Arg Leu Leu Pro Gly Pro Gin 
1 5 10 15 

Glu Asn Ser Val Gin Ser Ser 
20 

<210>38 
<211>30 
<212>DNA 

<213> Homo sapiens 
<400>38 

atgtgcaaca ctcaggcctg cgagaagacc 30 



3300 
3350 
3400 
3450 
3500 
3512 



<210>39 
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<211>30 
<212>DNA 

<213> Homo sapiens 
<400>39 

ccaacctgac cagtgtctac attgccaacc 



<210>40 
<211>21 
<212>DNA 

<213> Homo sapiens 
<400>40 

ctggagccct gcccacctag g 



<210>41 
<211>62 
<212>DNA 

<213> Homo sapiens 
<400>41 

gccgtcgact cttatcactt atcgtcatcg tccttgtagt cttgcgacat gaactccagc 
tg 



<210>42 
<211>62 
<212>DNA 

<213> Homo sapiens 
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<400>42 

gccgtcgact cttatcactt atcgtcatcg tccttgtagt ccaggttggg ggtaactgtc 
ag 

<210>43 
<211>62 
<212>DNA 

<213> Homo sapiens 
<400>43 

gccgtcgact cttatcactt atcgtcatcg tccttgtagt ccacgtgtgc agcttgagcc 
cc 



<210>44 
<211>62 
<212>DNA 

<213> Homo sapiens 
<400>44 

gccgtcgact cttatcactt atcgtcatcg tccttgtagt ccctaggtgg gcagggctcc 
ag 



<210>45 
<211>62 
<212>DNA 

<213> Homo sapiens 
<400>45 
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gccgtcgact cttatcactt atcgtcatcg tccttgtagt caccctgtcc cacacagggc 
cc 

<210>46 
<211>60 
<212>DNA 

<213> Homo sapiens 
<400>46 

tccaagcttg tcgactctta tcacttatcg tcatcgtcct tgtagtcggt tccttccttt 
<210>47 

<211>27 

<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthet ic DNA 
<400>47 

gactacaagg acgatgacga taagtga 

<210>48 

<211>8 

<212>RPT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Seauence: Synthetic 
<400>47 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



Claims 



1. A protease that is capable of cleaving a bond between residues Tyr-842 and Met-843 of von Willebrand factor 
(hereinafter referred to as "vWF") and comprises a polypeptide chain having the amino acid sequence Leu-Leu- 
Val-Ala-Val as a partial sequence or an amino acid sequence with deletion, substitution, or addition of one or 
several amino acids in said amino acid sequence. 

2. The protease according to claim 1 , which comprises a polypeptide chain having the amino acid sequence Ala-Ala- 
Gly-Gly-lle-Leu-His-Leu-Glu-Leu-Leu-Val-Ala-Val as the N-terminal partial sequence of a mature protein or an 
amino acid sequence with deletion, substitution, or addition of one or several amino acids in said amino acid 
sequence. 

3. The protease according to claim 1 or 2, which comprises a polypeptide chain having an amino acid sequence with 
deletion, substitution, or addition of one or several amino acids in the amino acid sequence as shown in SEQ ID 
NO: 3 or 7 or a partial sequence of any of the aforementioned amino acid sequences as the N-terminal partial 
sequence of a mature protein or the aforementioned amino acid sequence. 

4. The protease according to any one of claims 1 to 3, which comprises a polypeptide chain having an amino acid 
sequence with deletion, substitution, or addition of one or several amino acids in the amino acid sequence as 
shown in any of SEQ ID NOs: 16 to 21. 

5. The protease according to any one of claims 1 to 4, which has molecular weight of 1 05 to 1 60 kDa or 1 60 to 250 
kDa in SDS-PAGE under reducing or non-reducing conditions. 

6. A gene fragment encoding a protease that is capable of cleaving a bond between residues Tyr-842 and Met-843 
of vWF and comprises a polypeptide chain having the amino acid sequence Leu-Leu-Val-Ala-Val as a partial se- 
quence or an amino acid sequence with deletion, substitution, or addition of one or several amino acids in said 
amino acid sequence. 

7. A gene fragment encoding the protease according to any one of claims 2 to 5. 

8. DNA encoding the protease according to any one of claims 1 to 5, which comprises a nucleotide sequence encoding 
a polypeptide capable of cleaving a bond between residues Tyr 842 and Met 843 of vWF comprising CTG CTG 
GTG GCC GTG or with deletion, substitution, or addition of one or several nucleotides therein. 



9. The DNA encoding a protease according to claim 8, which comprises a nucleotide sequence comprising GCT 
GCA GGC GGC ATC CTA CAC CTG GAG CTG CTG GTG GCC GTG or with deletion, substitution, or addition of 
one or several nucleotides therein. 



10. The DNA encoding a protease according to claim 8 or 9, which comprises a nucleotide sequence with deletion, 
substitution or addition of one or several nucleotides in the nucleotide sequence as shown in SEQ ID NO: 6 or a 
partial sequence of any of the nucleotide sequences, or the nucleotide sequence. 
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11. The DNA encoding a protease according to any one of claims 8 to 10, which comprises a nucleotide sequence 
with deletion, substitution or addition of one or several nucleotides in the nucleotide sequence as shown in SEQ 
ID NO: 15 or a partial sequence of any of the nucleotide sequences, or the nucleotide sequence. 

12. A vector comprising the DNA encoding a protease according to claim 8 or 9, which comprises a nucleotide se- 
quence with deletion, substitution or addition of one or several nucleotides in the nucleotide sequence as shown 
in SEQ ID NO: 6 or 15 or a partial sequence of any of the nucleotide sequences, or the nucleotide sequence. 

13. The vector according to claim 12 comprising a polypeptide encoding domain and specialized in the expression of 
said polypeptide. 

14. A cell transformed or transfected with the vector according to claim 12. 

15. A host cell transformed or transfected with the expression vector according to claim 13. 

16. A pharmaceutical composition comprising the protease according to any one of claims 1 to 5. 

17. The pharmaceutical composition according to claim 16, which is applied to treating diseases caused by deterio- 
ration in activity of the protease according to any one of claims 1 to 5, which is involved with gene defects or liver 
diseases. 

18. The pharmaceutical composition according to claim 16 or 17, which is applied to the inhibition of platelet aggre- 
gation caused by the formation of excess vWF high-molecular-weight multimers. 

19. The pharmaceutical composition according to claim 18, wherein the disease is thrombotic thrombocytopenic pur- 
pura. 

20. An antibody against the protease according to any one of claims 1 to 5. 

21. The antibody according to claim 20 against the protease according to any one of claims 1 to 5, which is capable 
of inhibiting or neutralizing the protease activity. 

22. The antibody according to claim 20 against the protease according to any one of claims 1 to 5, which can be used 
for affinity purification of the protease. 

23. A process for purifying the protease according to any one of claims 1 to 5, which utilizes the antibody according 
to claim 22. 

24. A pharmaceutical composition or diagnostic agent comprising an antibody against the protease according to any 
one of claims 1 to 5. 

25. An antagonist, inhibitor, agonist, or activity regulator against the protease according to any one of claims 1 to 5. 

26. A pharmaceutical composition or diagnostic agent comprising an antagonist, inhibitor, agonist, or activity regulator 
against the protease according to any one of claims 1 to 5. 

27. A pharmaceutical composition or diagnostic agent comprising the DNA according to any one of claims 8 to 11 or 
antisense DNA thereof. 

28. The pharmaceutical composition according to claim 27, which is used for gene therapy intended to cure diseases 
caused by deterioration in activity of the protease according to any one of claims 1 to 5, which is involved with 
gene defects or liver diseases. 

29. A process for assaying vWF-cleaving activity, wherein a protease-substrate reaction is carried out using vWF and 
vWF-cleaving protease on a membrane filter, and a substrate sample is then recovered from the filter, followed by 
SDS-PAGE analysis without Western blotting. 

30. A process for screening for a compound capable of cleaving vWF, wherein the vWF-cleaving activity of a test 
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compound is assayed by the process according to claim 29. 

31 . A process for preparing the protease according to any one of claims 1 to 5, wherein human plasma fraction I paste 
is used as a starting material. 

32. A homologue of the protease according to any one of claims 1 to 5 derived from a different animal species or a 
homologous protein thereof. 

33. A gene encoding the homologue of the protein according to claim 32 derived from a different animal species or a 
homologous protein thereof. 

34. An animal having a modified gene encoding the homologue of the protein according to claim 32 derived from a 
different animal species or a homologous protein thereof. 
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FIG 1 
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FIG 3 
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FIG 4 
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FIG 5A 




96 



EP 1 391 516 A1 



FIG 5B 
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FIG 5C 
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FIG6A 
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FIG 6B 
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FIG 6C 
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FIG 7 
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FIG 8A 
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FIG 8B 
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FIG 9 



get gca ggc ggc ate eta cac ctg gag ctg ctg gtg gec gtg ggc 
Ala Ala Gly Gly lie Leu His Leu Glu Leu Leu Val Ala Val Gly 
15 10 15 

ccc gat gtc ttc cag get cac cag aag gac aca gag cgc tat gtg 
Pro Asp Val Phe Gin Ala His Gin Lys Asp Thr Glu Arg Tyr Val 

20 25 30 

etc acc aac etc aac ate ggg gca gaa ctg ctt egg gac ccg tec 
Leu Thr Asn Leu Asn lie Gly Ala Glu Leu Leu Arg Asp Pro Ser 

35 40 45 

ctg ggg get cag ttt egg gtg cac ctg gtg aag atg gtc att ctg 
Leu Gly Ala Gin Phe Arg Val His Leu Val Lys Met Val He Leu 

50 55 60 

aca gag cct gag ggt get cca aat ate aca gca aac etc acc teg 
Thr Glu Pro Glu Gly Ala Pro Asn He Thr Ala Asn Leu Thr Ser 

65 70 75 

tec ctg ctg age gtc tgt ggg tgg age cag acc ate aac cct gag 
Ser Leu Leu Ser Val Cys Gly Trp Ser Gin Thr He Asn Pro Glu 

80 85 90 

gac gac acg gat cct ggc cat get gac ctg gtc etc tat ate act 
Asp Asp Thr Asp Pro Gly His Ala Asp Leu Val Leu Tyr He Thr 
95 100 105 

agg ttt gac ctg gag ttg cct gat ggt aac egg cag gtg egg ggc 
Arg Phe Asp Leu Glu Leu Pro Asp Gly Asn Arg Gin Val Arg Gly 
110 . 115 120 

gtc acc cag ctg ggc ggt gec tgc tec cca acc tgg age tgc etc 
Val Thr Gin Leu Gly Gly Ala Cys Ser Pro Thr Trp Ser Cys Leu 
125 130 135 

att acc gag gac act ggc ttc gac ctg gga gtc acc att gec cat 
He Thr Glu Asp Thr Gly Phe Asp Leu Gly Val Thr He Ala His 
140 145 150 

gag att ggg cac age ttc ggc ctg gag cac gac 
Glu He Gly His Ser Phe Gly Leu Glu His Asp 
155 160 
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FIG 10 
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FIG 11 



gctgcaggcg gcatcctaca cctggagctg ctggtggccg tgggccccga tgtcttccag 
Primer 1 

gctcaccaga aggacacaga gcgctatgtg ctcaccaacc tcaacatcgg ggcagaactg 

Primer 3 

cttcgggacc cgtccctggg ggctcagttt cgggtgcacc tggtgaagat ggtcattctg 

acagagcctg agggtgctcc aaatatcaca gcaaacctca cctcgtccct gctgagcgtc 

tgtgggtgga gccagac cat caaccctgag gacgacac gg atcctggcca tgctgacctg 

Primer 4 

gtcctctata tcactaggtt tgacctggag ttgcctgatg gtaaccggca ggtgcggggc 

gtcacccagc tgggcggtgc ctgctcccca acctggagct gcctcattac cgaggacact 

ggcttcgacc tgggagtc ag cattgcccat gagattggg c acagcttcgg cctggagcac 

Primer 2 

gac 
Primer 1 

Sense: gctgcaggcg gcatcctaca cctggagctg 
Antisense - cagctccagg tgtaggatgc cgcctgcagc 

Primer 2 

Sense : accattgccc atgagattgg g 
Antisense : cccaatctca tgggcaatgg t 

Primer 3 

Sense : gcgctatgtg ctcaccaacc tcaacatcgg 
Antisense : ccgatgttga ggttggtgag cacatagcgc 

Primer 4 

Sense : atcaaccctg aggacgacac 
Antisense : gtgtcgtcct cagggttgat 
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Primer 1 

Xhol site 



FIG 12 

About 4.3 kb 
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Fragment 1 
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► -^ ■ 
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Primer 3 



Primer 4 

AccI site 



Primer 6 

Primer 5 

Avrll site 
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Primer 2 

Sal! site 



Xhol 




FIG 13 



Hindlll 



Xhol/AccyAvrlimindlll linker 



pCR2.1 about4kbp 




XhoI/AccI digestion 

Ligation 



Fragment 1 TA cloning XhoI/AccI digestion 



pCR fragment 1 (about 5.8 kbp) 



Accl/Avrll digestion 

Ligation 



Fragment 2 TA cloning Acc/Avrll digestion 



pCR fragments 1 + 2 (about 7.1 kbp) 



Avrll/HindlH digestion 

Ligation 



u 



Fragment 3 TA cloning Avrll/Hindlll digestion 



pCR fragments 1+2 + 3 (about 8.3 kbp) 



Xhol 



Accl 



Avrll 



pCR fragments 1+2 + 3 (about 8.3 kbp) 
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FIG 14 
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FIG 17 
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FIG 18 
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FIG 19 




293 cell culture supernatant (1/dilution factor) 
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FIG 20 
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FIG 21 
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FIG 22 



Full-length gene (about 4.3 kb) 



Stu I -S 



Stu I -S 



0449 stop-AS (A) 
^^1581 stop-AS (B) 
Acc 1 -S J^** 



W897 stop-AS (C) 
AvrII-S^ W1016 stop-AS (D) 

AvrH-S^^ 
Avrll-S 



■T1135 stop-AS (E) 



Antisense primer for C-terminus-lacking vector 



FL*G. 



^K: Sail site 



gactacaaggacgatgacgataagtga 
DYKDDDDK* 



Full length-AS (F) 



Primer sequence 
SUiI-S:SEQIDNO:38 
AccI-S:SEQIDNO:39 
AvrH-S: SEQIDNO:40 
Q449stop-AS : SEQ ID NO: 4 1 
T581 stop-AS: SEQ ID NO: 42 
W897stop-AS: SEQ ID NO: 43 
W1 01 6stop- AS : SEQ ED NO: 44 
T1l35stop-AS: SEQ ID NO: 45 
Full length- AS : SEQ ID NO: 46 



Ligation of each PCR fragments (A) to (F) 




I I I 



pCR 2.tvWFCF 



Stul/Sall digestion 



Stul/SaU digestion and ligation 
of fragments (A) and (B) 



AccI digestion 



AccI digestion and ligation 
of fragment (Q 




Avrll/SaiJ digestion 



Avrll/SaU digestion and ligation 
of fragments (D), (E), and (F) 
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